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Despite the mnnduction of molecular methods, such as me polymerase diate reacnon, and tejteney ^.^^^ 
microsaSlite markers, genotypiog remains a rate lirnitmg factor in oux abiliry to localize disease genes by lintege. 
a method for *morynto 

to %£^^ techniques developed will be useful « high tesdution genomic analyses ^^^f 

studies, cancer genetics, forensies, and cytogenetics including studies of uniparental disomy or other patterns of chromosomal inheritance. 
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GENOTYPING BY SIMULTANEOUS ANALYSIS 
OF MULTIPLE MICROSATELIJTE LOCJ 

The work leading to this invention was supported in part by Grant No. GM 47145 from 
the National Institutes of Health. The United States Government may retain certain rights in this 
invention. 

BACKGROUND OF THE INVENTION 
Field of the Invention 

This invention is directed to semi-automated methods for linkage mapping of the genome 
by genotyping of multiple microsatellite loci. 
Summary of Background Information 

Far most genetic disorders, there is no known biochemical defect. Consequently, the 
mutant genes associated with the disease and their disease-causing abnormal gene products are 
recognized solely by the anomalous phenotype they produce. Identifying the chromosomal 
localization for the gene(s) that produce these disease phenotypes is often the first crucial step 
toward isolation and characterization of the mutation(s) by recombinant DNA techniques. 
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The significance of mapping a gene is perhaps better appreciated when put into context 
with the human genome project Consider for a moment that even after every base of the DNA 
in the entire human genome has been sequenced through the Human Genome Initiative (HGI), 
and every gene has been localized in this sequence, it may still not be clear which disorders) 
arise from which gene(s). Each disease phenotype will still need to be "mapped" or «««~iiitH 
with a particular location in the genome. This is usually carried out by analyzing DNA isolated 
from blood specimens collected from individuals within families affected by a genetic disorder. 
Once a disorder or abnormal phenotype has been linked to a particular region on a chromosome, 
the limited number of genes within this area will permit us to suggest a eanrfiriatf gene that can 
contribute to the phenotype. Thus, once the localization of a major disease phenotype to a 
chromosomal region is confirmed, a few candidate genes can be examined for mutations as well 
as potential pathogenic rvrhflnim m , 

If ho genes have been mapped to the region, then linkage studies with closely- spaced 
surrounding markers can often be used to rirlmfatr a large chromosomal interval (1-2 Mb) in 
which to search for transcribed sequences. This approa ch (originally termed "reverse genetics") 
is now generally referred to as "positional cloning*. In the past the isolation of candidate genes 
from these large genomic regions was the rate-limiting step in positional cloning, requiring yean 
of intensive work. However, recent improvements in methods to capture expressed sequences 
encoded within large genomic segments have been described. Thus, there is now a need for 
advances in the molecular genetic methods employed in the linkage mapping of disease genes. 

The chro mo so m es are the basic units of inheritance on which genes and DNA markers 
are organized in a linear fashion (see Figure 1). Linkage is evident when a gene(s) that 
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produces a phenotypic trait, or a significant portion of the trait, and the surrounding DNA 
markers are inherited together (cosegregate at meiosis). In contrast, those markers that are not 
a<yv-ifltffrf with the anomalous phenotype of interest will be randomly distributed among affected 
family members as a result of the independent assortment of chromosomes and crossing over 
during meiosis (see Figure 2, compare "A" markers to "B"-*F" markers). 

In general, the further a marker, or gene, is from the genetic locus of interest (for 
example, markers 1 and 4 as co mpar e d to markers 2 and 3 in Figure 1), the more likely they 
will be separated by crossing over at meiosis. The recombinant genotypes produced by crossing 
over between maternal and paternal chromosomes at meiosis allows us to predict the ordering 
of genes and markers through the interval under examination. Recombination between the 
markers 1A and 3A, and 2A and 4A in the affected members in Figure 2, suggest that the 
mutant gene of interest lies between markers 1 and 4. Thus linkage to a marker of known 
chromosomal location allows placement of the phenotype on the chromosomal map. 

Analysis for testing linkage with use of DNA markers is based on standard likelihood 
theory. The DNA markers are used to recognize each of the parental chromosomes. Recall that 
in general each chromosome is ™K*riftvf independently of any other; and the likelihood of 
inheriting either chromosome of a pair from each parent is 50:50. Therefore, when a marker 
is unlinked to the gene(s) producing an anomalous phenotype, one expects both the maternal and 
paternal chromosomes to be equally distributed in the affected offspring. 

Linkage in the human is established by the method of likelihood ratios (see Ott, 1992 
"Analysis of Human Genetic linkage," The Johns Hopkins University Press, Baltimore, for a 
review). One compares the probability that observed family data, such as that in Figure 2, 
would arise under one hypothesis (for instance, linkage with no recombination with marker 2 
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or 3) to the probability that it would arise under an alternative hypothesis (typically, noalinkage). 
The ratio of these probabilities is called the odds ratio for one hypothesis relative to the other. 
By convention, mammalian geneticists prefer the log of the odds ratio, or the lod score. 
Generally, linkage is considered proven when the odds in favor of linkage versus nonlinkage 
become overwhelming, or reach 1000:1 (LOD « 3) (see Morton, 1955, Am. J. Hum. Genet., 
7:277-318). Linkage is rejected when the odds drop to 100:1 against this hypothesis (LOD = - 
2). The maximum likelihood estimate is the recombination fraction where the likelihood ratio 
is largest. Lod scores from multiple pedigrees are thus added until the score grows to 3 
(signifying 1000:1 odds) or Ms to -2 (indicating 1:100 odds). Linkage can be easily evaluated 
using likelihood ratios, even in complicated pedigrees, by testing on the computer for tfa nf 
competing hypothesis. Recently, additional strategies have been devised that can handle genetic 
heterogeneity more effectively (Oh, 1974, Am. J. Hum. Genet. , 26:588-597) as well as disorders 
caused by multiple genes, (Lander, et aL, 1986, Proc Natl Acad. Set USA, £2:7353-7357). 
GtnotYPing With Molecular Gametic Methods 

The descriptions of many types of DNA sequence polymorphisms have provided the 
fundamental basis for our understanding of the structure of the mammalian genome (CEPH 
consortium map/ 1992, Science, 252:67-86; Weissenbach et al., 1992, Nature, 352:794). Hie 
construction of extensive framework linkage maps has been greatly feffiw*at*H by the use of these 
DNA polymorphisms, and has provided a practical means for the localization of disease genes 
by linkage. The process of linkage mapping in Mendeiian and complex disorders using t tw 
techniques has been further facilitated by the recent description of a detailed 'second-generation" 
linkage map of the human genome (Weissenbach et aL, 1992). In particular the recent 
description of highly polymorphic PCR-based microsatellite markers far genotyping has greatly 
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advanced the construction of high resolution linkage maps (Weber and May, 1989, Am. J. Bum. 
Genet., 44.:388-396; Litt and Luty, 1989, Am. J. Hum. Genet., 44:397-401). 

The microsatdlite markers are highly polymorphic, simple sequence repeat (SSR) 
markers, generally defined as repeats of 6 bp or less running in tandem for up to 100 bp long 
(Beckmann, et ah, 1992, Genomics, 12:627-631). These repeat sequences are flanked by unique 
DNA sequences that may be identified for each marker location. With primers that correspond 
to the unique DNA sequence surrounding each marker, the polymerase chain reaction (PCR, see, 
e.g., Saild, et aL, 1988, Science, 222:489) can be used to detect each polymorphism. 

This type of genetic marker is abundant and found throughout the genome. SSR may be 
ajfreque«asor*every6kb(Beckaa^ 1992). Where SSR markers show considerable 
polymorphism (differences in the number of repeats) between individuals, the markers can be 
particularly infbnnative. Many such SSR markers have been isolated throughout the genome, 
and are well mapped (Weissenbach, et aL, 1992). Many of these SSR markers are now 
available commercially for linkage studies (e.g., from Research Genetics, Huntsville, AL). 
Those markers which frequently allow the investigator to identify each parental chromosome as 
unique and to identify each crossover rapidly (see Figure 2) approach the ideal for linkage 
studies. 

Most SSR are (GT), dinucleotide repeat length polymorphisms (see Figure 3). It is 
cstiinated that there are ^ 100,000 of the (GT). type SSR, or one approximately every 30 
kb (Beckmann, et al, 1992). Over 1,000 SSR markers have been described to date in the 
Genome Data Base, October 19, 1993, The Johns Hopkins University, Baltimore, Maryland, 
and thousands of additional markers are now in development 
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It is now well accepted that methods based on the polymerase chain reaction (PGR) and 
highly polymorphic simple sequence repeat (SSR) markers (e.g. Figure 3) are the techniques of 
choice for genotyping in linkage studies (Weber, et al. t 1989; Iitt, et al., 1989; Edwards, et al. 
1991, Am /. Hum. Geneu, i£746-56). FCR-based methods are faster and therefore less costly 
than restriction fragment length polymorphism (RFLP) methods; moreover, they do not require 
nucleic acid probes, and are more informative in linkage studies. Efforts are underway to 
develop automated techniques for genotyping that will further improve the efficiency of linkage 
studies utilizing this type of microsatellite markers polymorphism. The advantages of analyzing 
multiple polymorphic loci using an automated DNA sequencer were first described by Skolnick 
and Wallace in 1988 (Genomics, 2:273-279). Building on techniques reported by Connell, et 
al. (1987, Biouctmiques, 5:342-348), Zegleet aL, (1992 Genomics, 14:102^1031), extended 
this approach to incorporate automated DNA sizing technology for genotyping microsatellite loci 
using four color fluorescence-based techniques. 

However, the analysis of microsatellite markers still relies on gd electrophoresis winch 
has limited sample handling capacity. Furthermore, the gel electrophoresis of DNA fragments 
is complicated by problems with gd distortion, such as band shifting that warrant internal size 
standards and bandmatching software (Lander, 1991, Am .7. Hum. Genet, 42:819-823). 
Crosstalk or interference during analysis between multiple dyes with spectral overlap is another 
potential problem when multiple FCR fragments of the same size are to be identified within the 
same gd lane. Since the processing of gds and the scoring of autoradiographs remains the 
rate-limiting step in genotyping, methods are being sought that improve the efficiency of sample 
handling while minimizing er ro rs in data transcription and analysis. 
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The challenge of mapping the major genes in complex disorders requires efficient and 
highly accurate methods of genotyping. Recent technological enhancements in molecular 
genetics have significantly improved our ability to locate disease genes by linkage analysis. 
However, despite the introduction of molecular methods, such as PCR, and the discovery of 
highly polymorphic SSR, genotyping is still rate-limiting for localizing disease genes by linkage. 
Hie present methods remain highly l m * t v ira \], time-consuming, and expensive. 
SUMMARY OF THE INVENTION 

It is an object of this invention to provide a robust semi-automated protocol for 
genotyping using mulfiplfflr analysis of many mic ms a triHfr Ind white inaintatmwg < ^ improvi ng , 

typing accuracy as compared to traditional methods. It is also an object of this invention 
to provide a collection of highly reproducible microsalellite markers at approximately 10-50 cM 
intervals throughout the human genome which can be detectably-labeUed. 

It is a further object to provide protocols for the reliable use of these marker systems in 
automated genotyping. 

To meet these and other objects, and to better exploit the inherent advantages of 
fluorescence-based genotyping techniques, this invention provides highly informative SSR 
markers, assembled into "SETS" that do not overlap in size when separated electrophoretically 
on an acrylamide gel and that can be labelled with different fluorophores. Each SET contains 
6 or mare pain of primers that provide for amplification of markers (preferably 7-8 pain of 
primers) that have been labelled with the same fiuorophore having a distinct color, separate 
SETs having different fluorophore labels (eg., blue, green, or yellow). PCR products 
corresponding to these SETS are combined into a GROUP for electrophoretic analysis in a single 
lane. Using this methodology, a GROUP of 18 or more, preferably 21 to 24 dinudeotide 
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markers can be electrophoresed along with an internal size standard and analyzed simultaneously 
(multiplexing) in real-time for each individual studied. 

In particular, the invention provides a kit for use in automated genotyping within a 
population comprising four or more GROUPS, each GROUP containing at least three SETS, and 
5 each SET in turn comprising at least 6 labelled pain of primers for amplification of DNA by 
polymerase chain reaction (PGR), the sequence of each primer pair corresponding to a portion 
of the unique genomic sequence of a microsatellite sequence (which is made up of a nucleotide 
repeat sequence flanked by unique sequences), the nucleotide repeat sequence being polymorphic 
within the population* Amplification of DNA from a human sample by the polymerase chain 

10 reaction (PCR) primed with a particular primer pair amplifies the nucleotide repeat sequence and 
at least some of the immediately adjacent unique sequences of the microsatellite sequence to 
produce a PCR product identified with the primer pair. The distance in the genome between the 
microsatellite sequence amplified by one primer pair of the kit and the nearest other 
microsatellite sequence amplified by another primer pair of the kit is at least 2 centimorgans 

15 (cM) and no more than SO cM. Each SET consists of at least 6 of the primer pain, where the 
length of the segment amplified by a particular primer pair (its PCR product) diffen from the 
length of PCR products from all other primer pain in the SET by at least 5 nucleotides for 
tetranudeotide repeats, at least 6 nucleotides for trinucleotide repeats and at least 9 nucleotides 
far dinudeotide repeats. At least one primer of each primer pair is labelled with a fluorescent 

20 label that is the same for all primer pain in the SET. Each GROUP consists of at least three 
SETS of primer pain labelled with fluorescent labels, and primers from one SET in the GROUP 
are labelled with a fluorescent label which fluoresces at a wavelength which is substantially 
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different from the wavelength at which the fluorescent labels on the primers in each of the other 
SETS in the GROUP fluoresce. 

Where the primers in a single lot cover the entire genome with markers spaced 
approximately 10 cM apart in the genome, the kit will usually contain at least about 10 
GROUPS. In another embodiment, a kit is provided for screening of the genome with individual 
markers spaced in the genome about 50 cM from the nearest other marker in the kit, and the kit 
contains at least 4 GROUPS. The invention also provides kits containing fewer GROUPS with 
primers whose PCR products identify microsatellite sequences found in the genome T pTil 
closely about the locations picked out by screening studies performed using the screening kit 

The invention also provides a method of analyzing genomic DNA for the presence of 
polymorphisms comprising: extracting DNA from a human sample; combining, in a polymerase 
chain reaction (PCR) vessel, an aliquot of the extracted DNA, at least one primer pair «"'»-tH 
from one of the GROUPS described above, and PCR amplification enzymes; cycling the 
teniperature of ead» ra vessd to prc^ 

pair whose sequence corresponds to unique sequence in the amplified DNA, using an annealing 
ternperamre at which non-specific annealing is rmninrized; then combining all PCR products 
from all PCR vessels remaining primer pairs from a single GROUP into a mixture, and 
subsequently separating the mixture of PCR products dectropboretically by size; and detecting 
separated PCR products by fluorescence detection at wavelengths corresponding to the 
fluorescent wavelength for each of the fluorescent labels in the kit In a preferred embodiment, 
one primer of each primer pair is labelled with a fluorescent label and the other primer in the 
pair is labelled with biotin, and a mixture containing all PCR products corresponding to the 
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primer pain from a single GROUP is prepared by binding the PCR products to a plurality of 
paramagnetic beads carrying on their surface a protein which specifically binds biotin (the beads 
being added to each PCR vessel after amplification), separating the magnetic beads from the 
PCR reaction medium, then separating the two strands of the amplified DNA segments and 
combining the strands labelled with a fluorescent label for all primer pain from one GROUP 
into the mixture. 

The invention also provides a method for idfcting a SET of PCR primers for use in 
automated genotyping comprising selecting at least 6 microsatellite sequences, which contain di- 
nudeotide, trinucleotide or tetranudeotide repeat sequences that are flanked by unique sequences 
in the human genome, and are polymorphic within the population, the microsatellite Tinr-s 
being separated from each other by at least 2 centimorgans in the genome, and for each 
mkrosatelHte sequence constructing primer pain having the sequence of the unique sequences 
flanking the m icrosatrltite sequences, so that the primer pain will direct PCR amplified™, of 
DNA segments corresponding to each microsatellite sequence and the length of all polymorphs 
of the microsatellite sequence amplified by a partiailar primer pair is detectably different from 
the length of all polymorphs of other microsatellite sequences amplified by other primer pain 
in the SET. The invention also provides a kit for use in automated genotyping comprising at 
least 10 GROUPS of at least 3 SETS of PCR primers obtained by this method, and a method 
of analyzing genomic DNA for the presence of polymorphisms comprising amplifying DNA 
extracted from a human sample using PCR directed by these primer pain to produce PCR 
products labelled with detectable labels that are the same for all PCR products from a single 
SET, followed by separating dectrophoretically a mixture containing all PCR products amplified 
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from the ONA sample by any primer pair of said SET and characterizing the detectabiy labelled 
PCR products by length. 

The invention also provides a diagnostic method for detection by polymerase chain 
reaction of genomic rearrangement (including deletions, additions, crossovers and gene 
a mplifi ca tio n), of a genomic region containing at least 6 known loci at which genetic 
rea rr a ng ement is diagnostic for a disease, using a kit comprising at least one SET containing at 
least 6 PCR primer pairs, the sequences of each primer pair corresponding to the unique 
seque nc es flanking one of the loci of genomic rearrangement The primer pairs in the SET are 
constructed so that the PCR product amplified by a particular pair of primers corresponds to a 
DNA segment surrounding one locus of rearrangement with length that is characteristic of a 
specific rearrangement, and the length of the PCR products amplified by a particular pair of 
primers differs from the length of all other PCR products amplified by other primers in the SET. 
DNA from a sample is amplified in a PCR vessel using the polymerase reaction (PCR) 
primed with at least one of the primer pairs of the SET by cycling the temperature of the vessels 
with an annealing temperature that minimizes non-specific annealing to produce detectabiy 
labelled PCR products, and the PCR products for all primer pairs in the SET are detectabiy 
labelled with the same labeL Labelled PCR products are separated electrophorctically by size 
from a mixture containing all PCR products amplified from the DNA sample by any primer pair 
of the SET, and the separated, detectabiy labelled PCR products are characterized by length. 
In a preferred mode, all primers in the SET have annealing temperatures within a 4C range, and 
amplification for all primers in the SET is carried out simultaneously in the same vessel. 

The inventor has created a kit comprising SETS of highly polymorphic fluorescent 
primers specific for nricrosatellite markers that cover the genome at approximately 10 cM 
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intervals for linkage studies. A fluorescence-based protocol based on these SETS has been 
developed for detection of multiple microsatellite markers, and the protocol is accurate as 
compared to a conventional radiolabeling method that depends on a known DNA sequence ladder 
and conventional autoradiography for detection. It has now been demonstrated that genotyping 
by semi-automated fluorescence-based techniques is both highly accurate and efficient We 
routinely type 24 fluorescent markers simultaneously using these frrhniqucs in my laboratory. 
The combined analysis of 24 dinucleotide markers in a single gel mayimiyfi the use of 
automated analysis equipment, such as the Applied Biosystems 373A hardware, by producing 
FCR products sufficiently small to run the instrument at least twice daily. The methods 
provided herein may improve productivity by more than an order of magnitude and can be easily 
adopted to most linkage studies. 
BRIEF DESCRIPTION OF THE FIGURES 

Figure 1 shows the genetic map of the chromosomal region surrounding a putative 
GENETIC locus. In this example the greater the spacing between markers the more likely 
recombination will occur during mciosis. 

Figure 2 shows segregation data from a fabricated three generation family affected with 
a genetic disorder for the four markers illustrated in Figure 1. Squares indicate males, circles 
indicate females. Affected and unaffected family members are indicated by solid and open 
symbols, respectively. Crossovers that have occurred during meiosis are indicated by the 
arrowheads. Recombination with markers 1 and 4 from chromosome A exclude a localization 
for the gene causing this disorder in the region immediately above marker 1 and below marirgf 
4. The region from chromosome A between markers 1 and 4 (including markers 2 and 3) co- 
segregates with the abnormal phenotype in all the affected individuals in this family but is not 
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found in any unaffected individuals. These data confirm a localization for the GENETIC locus 
under study to this chromosomal region. 

Chromosomal region 4 of chromosome B from affected individual 1-1 occurs in both 
affected and un a ff ec t ed offspring in generation II, showing no linkage. The markers used in this 
demonstration approach the ideal by providing maximal genetic information for every individual 
studied. 

Figure 3 illustrates the most common form of simple sequence repeat In this individual 
the marker is heterozygous, or differs in the number of dinucleotides between the maternal and 
paternal chromosomes. These PCR products would differ in length by 8 nucleotides, and are 
each easily detected using gel electrophoresis. The solid ban indicate surrounding sequent that 
is unique (occurs only once in the human genome) and can be used to design PCR primers for 
amplifying this simple «rn»nff repeat. 

Figure 4 shows a cartoon of GROUP 1 T=*nh sm&c sequence repeat "»*t 

is identified on the left, and the size range for known alleles are noted on the right Each 
marker coven a region of a chromosome to be examined for linkage with a genetic disorder. 
The colored boxes refer to the region on the gel where alleles for each marker may be found. 
The markers are chosen to avoid overlap between these regions. For increased efficiency each 
SET is labelled with one of three fiuorophores - yellow: tetramemyl-6<arboxy-ihodaniine 
(TMR), blue: 5sarboxy-fluorescein (FAM), and green: 2\7'-dimemoxy^\5'-ciichIoro-6- 
carboxy-fluorescein (JOE); (red o^carboxy-rhodaniine (ROX) is reserved for internal size 
standards), Applied Biosystems. The products of the PCR amplifications are pooled and 
subjected to the dectrophoresis together. Marker data are derived from the Genome Data Base 
(GDB), The Johns Hopkins University, Baltimore, Maryland. 
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Figure 5 shows a typical set of electrophoretograms for GROUP 2 using DNA from a 
single individual. 

Figure 6 shows an eiectrophoretogram of SET A, GROUP 1 markers from one 
individual The size (nucleotides) of each PCR product is given on the X-axis above the 
eiectrophoretogram. 

Figure 7 A-M provides a listing of the markers in 13 GROUPS each containing 16-24 
markers divided into three SETS, Hie first column gives a locus designation for the marker to 
identify the entry in the Genbank Data Base which provides the unique sequences surrounding 
the markers. The unique sequence information can be used to design primers that will direct 
PGR amplification of the marker. After the locus designation, the size range of the published 
alleles (in base pairs), the degree of h ete r ozygosi ty in the population and the chromosomal 
location are listed, in that order, for each marker followed by the nucleotide sequences of 
preferred primer pain, along with their annealing temperatures and preferred choice for labelled 
primer. 

Figure 8 demonstrates the difference in autoradiographic image produced depending on 
whether the forward or reverse primer is labelled. 

Figure 9 shows an autoradiograph of PCR-amplified DNA using the primers of GROUP 
2, SET B. The variation in intensity in products of this SET is typical of this type of marker. 

Figure 10 shows the effect of varying the amount of paramagnetic beads in a magnetic 
bead-based recovery from PGR. 
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DET AILED DESCRIPTION OF THE INVENTION 

Methods for sequencing DNA, for synthesizing oligodeoxynucieotides of defined 
sequence, and for separating nucleic acid segments by molecular weight using, e.g., 
electrophoresis are well known to those skilled in the art and well described in the literature, in, 
for example, "Molecular Cloning: A Laboratory Manual," Sambrook, et aL, eds M Cold Spring 
Harbor Laboratory Press, 1989. General methods of analyzing DNA by the polymerase chain 
reaction (PCR) including isolation and preparation of DNA templates, synthesis and labelling 
of primers, amplification, and analysis of PCR products are also well known and described in 
the literature, for example in Sambrook, et aL, 1989, or in "PCR Protocols: A Guide to 
Methods and Applications," Innis, et aL, eds., Academic Press, 1990. The skilled worker in 
this art is familiar with these and other methods of manipulating and analyzing DNA, and 
routine application of such methods within the skill of the ordinary skilled worker is assumed 
in the following description. 
Semi-Automated Genotyping: 

Despite the improvements in linkage techniques introduced by PCR and SSRs, genotyping 
r emains highly technical, time consuming, and expensive. The application of fluorescence-based 
technology is one way to further reduce the cost and increase the efficiency of this type of 
project. Fluor es c en t labeling of PCR-based markers provides many potential advantages over 
radio-labels (eg., *P) and other labels in common use for PCR markers. Fluorescent labels are 
nontoxic, stable, and can be combined and analyzed together in a single dectrophoretic lane 
(multiplexing) to provide a many-fold increase in efficiency over standard methods of detection. 
Fluorescence signals are linear over a much greater range of intensity than conventional 
autoradiography and other methods of detection in use, providing a better means of 
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distmguishing between alleles and artifact. Band intensity provides an objective method for 
distinguishing between alleles and artifacts and may also provide a better means for identifying 
the products of microsateUite markers that frequently vary significantly in intensity. 

Ultimately, real-time fluorescence detection methods may provide a substantial increase 
in efficiency over standard methods of detection based on radio-labeling. A much larger range 
of product sizes can be resolved on each gel run a compared tn raHHabrH n g Mrhtw<yu»T H*f*n*» 
with the automated, real-time equipment such as the Applied Biosystems Inc., the PCR products 
pass by the detector toward the bottom of the gel where the band resolution is greatest 
Efficiency is further improved by the potential real-time semi-automated detection of »'VH 
In addition, internal size standards are easily incorporated for reproducibility and the accurate 
sizing of alleles, avoiding day to day variability: Computerized data acquisition and handling 
further aid productivity and reduce errors in data entry and mampulaticn. Ultimately, 
automation is likely to occur more rapidly with fluorescence-based techniques then with other 
methods of labeling and detection. 

As an initial test of the fluorescence technology, a study was conducted comparing the 
accuracy and reliability of these methods with n P end-labeling (see Example 1). Tnree markers 
were chosen because they produce PCR products of the same size range. Products of PCR 
reactions run wimrmrrra complementary 

these markers were obtained using primer pairs in which one primer of each pair was conjugated 
to a fluorescent label. These PCR products were electrophoresed simultaneously in a single 
dectrophoretic lane to test if these genotypes could be accurately Determined. Similar to me 
report by Ziegel,et at, 1992, there was no difficulty in discerning PCR fragments of the same 
size labelled with different fiuorophores. 
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Determining the size of DNA fragments accurately is critical to geao typing in a number 
of applications. When parental alleles are available, a simple comparison can determine which, 
if either, parental allele has been passed on to a child. However, frequently in linkage studies 
the parental all e l e s are not available for comparison, and paternity must be questioned. This is 
also true in DNA forensics, where an unknown must be compared with many others and its size 
determined unambiguously. The analysis of PCR products that differ grossly in concentration 
is complicated by bandshifting and other gel related artifacts. The accuracy of this typing 
procedure must be based on empiric studies of reproducibility 'known' samples as 
standards. Non-polymorphic internal size standards can be used to remedy these problems 
(Lander, 1991). 

Example 1 demonstrates the accuracy of sizing microsateilite PCR products using a 
fluorescence-based approach as compared to a conventional radiation-based method using a 
known sequence ladder. DNA templates may be obtained from the collection of Centre d'Etnde 
du Polymorphisme Humaine, Paris (CEPH) for use as a standard set of alleles to compare these 
techniques, because there is little question of the genetic identity of each of the individuals in 
this collection. To avoid ambiguity in genotyping with the fluorescent method, factional size 
estimates should preferably be accurate to within 0.5 nucleotides. Variation greater than this 
could lead to confusion during band notching, after rounding up or down for size estimates 
provided as a fraction of a nucleotide. Since our analysis suggests that the maximum variation 
is likely to be less than 0.5 nucleotides (and generally significantly less), the method will be 
useful in the intended applications. 

As shown in Example 1, no sizing errors occurred with the use of the multi-color 
fluorescence-based technique, showing that this methodology is highly accurate and reproducible 
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for scoring microsalellite markers. Since the only sizing error resulted from the use of the 
conventional radiolabeling technique/ the fluorescence-based protocol appears at least as accurate - 
as the conventional method. Therefore, this approach appears to adequately compensate far gel 
distortion and dye related artifacts as compared to radiation labeling techniques. 

Accordingly, the advantages demonstrated for fluorescence-based techniques may be 
exploited by the method of this invention, which uses at least 6 highly informative SSR markers 
assembled into a ladder which we have designated a "SET". Each SSR marker is characterized 
by PGR primer pairs which have the same sequence as a portion of the unique DNA fT?mrr 
on the 5' side of the sense and antisense strands, r espe ctiv ely, encoding the repeat sequence at 
a particular point in the genome. When the genetic material of a particular individual is 
amplified by PCR using one of these primer pain, a segment of DNA corresponding to the 
sequence of the particular SSR and its unique flanking sequences is produced (the PCR product). 
The size of the PCR product is dependent both on how much of the unique sequences are 
covered by the primers in the pair and on the number of times the repeat sequence is repeated. 
The number of repeats of the simple sequence at a particular locus varies between individuals 
(polymorphism), and this polymorphism results in PCR products of varying size for different 
individuals. Thus the size of the PCR product can be used to determine if two individuals have 
an allele in common at the genetic locus of the SSR marker. 

The spacing in the gel between PCR products identified with different markers is critical. 
By carefully selecting the length of the primer sequences for each marker, the PCR products 
corresponding to each marker in a SET arc spaced a critical distance from surrounding markers 
such that none of the PCR products for the largest known alleles of one marker overlap in size 
with PGR products for the shortest known alleles of another marker in the SET when separated 
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on a 6% denaturing acrylamide gd. An additional safety margin should be provided, because 
rare undocumented alleles (larger or smaller) may occur for any given marker. Size spacing of 
less than 9 nucleotides between dinucleotide SSR markers increases the likelihood for overlap 
because 2-4 stuttering bands (each 2 nucleotides apart) below the smallest of one marker 
may overlap with the largest allele of the marker below it PCR products for trinucleotide 
repeat sequences and tetranucleotide repeat sequences are not observed to exhibit *tr^*^«g 
bands, so the minimum separation distance above and below the largest and smallest known 
alleles can be less for tri- and tetranucleotide repeats. Usually, PCR products for trinucleotide 
repeats in a SET will differ by at least 5 base pairs, and for tetranucleotide markers by at least 
6 base pans. Preferably a SET will contain 7-9 SSR markers, most preferably 8-9 markers. 
The upper limit on the number of markers in a SET is dependent on the length of the 
electrophoretic separation. 

The PCR product of each primer pair in the SET is tagged with the same label, 
preferably a fluorescent dye. Usually a fluorescent label is covalently attached to one of the 
primers in a primer pair. Alternatively, the PCR product may be uniformly labelled by adding 
one or more fluorescently-labellai nucleoside triphosphates to the PCR reaction. Labelling of 
the primers may be accomplished by including a fluorescently-labelled nucleotide during 
synthesis of the primer or by linking a fluorescent label to the primer after synthesis. 
Fluorophore labels for attachment to nucleic acids, including PCR primers, are readily available 
in the art (See, e.g M Nagaoka, et aL, (1992) Chan. Pharm. BuB,, 4Q:2559-2561; Giusti, et 
tL. (1993) PCR Methods Appl. , 2:223-227; Alexandrova, etal., Nucleic Acids Symp. Ser. 1991, 
p. 277; Schubert, et aL, (1992) DNA Seq., 2:273-279; Vu, et aL, (1990) Tetrahedron Lett., 
21:7269-7272.) Usually the labels contain coupling groups that react with modified nucleotides 
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of the PCR primen to form covalcnt links. Attaching such fluorophorcs to the primers in the 
SETS of this invention is easily within the skill of the ordinary worker. See, e.g., Levenson 
and Chang, 1990, "Nonisotopically labelled Probes and Primers/ in PCR Protocols, Innis, et 
aL, eds., Academic Press, NY. Fluorescent labels with non-overlapping emission spectra are 
also available commercially, for example, from Applied BioSystems, Inc., including 5-carboxy- 
fluorescein (FAM-blue), 2\7 , dimethoxy-4\5'siichloro- (JOE-green), 
N,N,N*,N f -tetramethyl*^<arboxy-rhodaraine(T^ 

red); from Biological Detection Systems, Inc., Pittsburgh, PA (BDS) including nucleoside 
triphosphates coupled to cyanine dyes that fluoresce in the green or orange region, or Boehringer 
Mannheim Corporation Biochemical Products, Indianapolis, IN, including fluorescein-5(6)- 
carboxanridocaproxyl-dDTP (yellow), 7-hydracy-coumarin-3-car^^ (blue), and 

tetramethylrhodamine-5(6^^ (red). 

Additional suggestions for selecting labels with non-overiapping fluorescent sp e ctra and 
derivitizing oligonucleotides, with them can be found in Smith, et aL 1986, Nature. 122:674- 
679, incorporated herein by reference. Alternatively, primers (or PCR products) may be 
labelled with biotin (see, e.g. f Innis, et aL, "PCR Protocols," Academic Press, NY, 1990, pp. 
100-103) and then streptavidin coupled to a particular fluorescent dye added to all of the PCR 
products of a particular SET. Variations of these labelling methods or similar methods known 
to those skilled in the art may be used, so long as all PCR product for markers in one SET are 
labelled with the same label. 

SETS, each labelled with a different fluorophore, can be pooled into a collection of 
markers that we have termed a "GROUP." The number of SETS in a GROUP will depend on 
the availability of distinct labels. PCR products for each SET in the GROUP will usually be 
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labelled with fluorophores that emit light at a wavelength substantially different from the 
wavelengths emitted by fluorophore labels of the other SETS in the GROUP, where 
'substantially different" means sufficiently distinct to be distinguished by the detection means 
chosen for detecting PCR products after electrophoresis. For example, three commercially 
available fiuorophores, referred to as TMR, FAM, and JOE (Applied Biosystems), have 
different colors which are yellow, blue, and green, respectively. 

Using this approach we have analyzed as many as 24 SSR markers in a «in g j f 
electrophoretic lane using three distinct fluorescent labels to label three SETS in the GROUP 
(see e.g. Fig. 4). Ia a preferred mode, these fluorescent PCR products may be separated on an 
automated electiupuoiesis systems, such as the Applied Biosystems 373 sequencer with internal 
size standards in each iane (labelled, for example, with ROX (red dye), Applied Biosystems) and 
analyzed using, e.g., GeneScan 672 software (Applied Biosystems) (ZSegle, etaL, 1991, Miami 
Short Rep., 1:70) and scored using GENOTYPER software (Applied Biosystems), with data 
displayed as an ele^facfetogram or in a spread sheet format Gd band fluorescent intensities 
and peak areas provide an objective method of distinguishing alleles from artifact (stuttering 
bands). A typical dectrophoretogram from a single individual for SET A GROUP 1 is 
illustrated in Figure 6. 
Marker Selection and Development: 

The human genome is estimated to be approximately 3000 cM in length. Therefore, to 
adequately 'cover- the entire genome at 10 cM intervals will require approximately 300 highly 
informative well spaced markers. An alternative estimate obtained by summing the mriotic 
maps from all the chromosomes suggests that the genome is approximately 5000 cM in length 
(NTH/CEPH Collaborative Mapping Group, 1992, Sdaux, 222:67-86). Adequate 'coverage- 
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of the entire genome based on this size estimate at 15 cM intervals (which would allow testing 
for linkage without using a prohibitively large number of families) will require about 333 highly 
informative well spaced markers. 

Characteristics of preferred markers can be summarized as follows: unique sequence 
surrounding the marker is available for use in designing primers, they have been sized 
accurately, the heterozygosity value is known, and each marker has been carefully localized. 
Over 1000 SSR markers, including the surrounding unique sequence and chromosomal location, 
have been described to date in the Genome Data Base (GDB), October 19, 1993, Hie Johns 
Hopkins University, Baltimore, Maryland. In contrast to older approaches, such as RFLP, many 
of the preferred SSRmaxken are heterozygous (aMes differ at a particular locus) > 50% of me 
time and therefore are highly informative for linkage studies. Each allele of the markers used 
in the method of this invention will be easily detectable after amplification by PCR as a 
predictable component of a complex image or signature by 5' end labeling with «P, labeling 
with fluorescence, or by a variety of other methods. Most preferably, the markers also produce 
an easily scored product or simple pattern of stutter bands that are the signature of 
mononucleotide and ttinn^Ttr titte repeats. 

Most dinudeotide repeats produce two or three smaller less intense products or "stutter 
bands- (Weber, 1989). TTiese are artifacts produced during PCR, and are less common in PCR 
of tri-andtetmudeotide repeats. Although these stutter bands have been generally considered 
undesirable, they can be quite helpful to the investigator (or computer) during the scoring of 
genotypes by allowing for the identification of "false' bands (background bands due to non- 
specific annealing). Each allele can then be easily scored by 5' end labeling with "P or 
fluorescence after amplification by PCR, as a predictable component of a complex image. 
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Background bands are generally not awviatrri with stuttering artifacts. Because artifacts due 
to nonspecific annealing are difficult to eliminate entirely from a PCR reaction, the adaptation 
of a similar protocol for the multiplex semi-automated genotyping of tri-, and tetxanudeotide 
repeats may be more problematic. The method of this invention reduces artifacts due to non- 
specific annealing by control of the annealing temperature for respective primers during 
temperature cycling. 

Hie use of dinucleotide SSR is preferred in the method of this invention, because the 
potential advantages for automated genotyping may not be so easily incorporated into practice 
for mono-, tri- and tetxanudeotide repeats. PCR products of trinucleotide and tetranucleotide 
repeats lack the unique "stuttering* signature of dinudeotide repeats, making it difficult for the 
computer to distinguish real alleles from artifacts produced by nonspecific annealing during 
PCR. Although a simple set of PCR products are produced as alkies (little or no stuttering) 
from tri- or tetranucleotide SSRs, it is often difficult to eliminate other PCR artifacts completdy . 
These PCR artifacts are not easily distinguished from "false" bands when large numbers of PCR 
products that vary significantly in intensity are combined as described by this method. The 
unique signature derived from the stuttering bands of dinudeotide repeats provides a simple 
means of distinguishing real products (alleles) from artifactual bands. 

Furthermore, the cost of the hardware is generally considered the limiting factor when 
adopting the fluorescent approach. Tri- and tetxanudeotide markers generally require a 
significantly larger fraction of each gd because alkies span a much larger size range. Thus 
longer run time is required, and fewer markers can be resolved per geL The cost of the 
hardware becomes readily affordable if one considers the utility and throughput of such an 
instrument when used according to the method of this invention. However, the use of fewer 
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markers per lane (i.e., tetranudeotide repeats) would substantially reduce the cost effectiveness 
of the hardware by reducing efficiency. 

Finally, far fewer of tri- and tetranucleotide markers have beer, fully characterized at 
present. Thus, the availability of well-chaiacterized primers which can be assembled into SETS 
5 and GROUPS remains another limiting factor at present. 
Construction of Marker SETS: 

The selection of markers for inclusion in each SET is based on the need to: maximize 
heterozygosity values (genetic informativeness), place the marker within a SET based on the size 
of the PGR products (alleles produced must not overlap with those of the marker above of below 

10 it), and the location of the marker in the genetic map (ideally we would have 450-500 markers 
placed 10 cM or less apart). The PCR products corresponding to markers within a SET are 
sized to assure that infrequent alleles and stutter bands do not produce overlap between the 
markers (compare e.g M Figures 4 and 6). PCR products for SETS of dinucleotide markers 
differ by approximately 9 nucleotides, preferably, at least 10 nucleotides, in length. When 

15 necessary, new oligonucleotide primers based on the unique sequence surrounding a polymorphic 
marker are designed and synthesized to assure that the PCR products do not overlap during 
eldwUophoresis 

Figures 7A-M show 289 SSR markers that have been selected and combined into 11 
GROUPS of 21-24 markers and 2 incomplete GROUPS of 16 markers so that markers in each 
20 GROUP can be separated and analyzed simultaneously. The selected markers cover the genetic 
map on average once every 10 cM. Most are heterozygous greater than 70% of the time. In 
a preferred embodiment, each SET is composed of 8 markers from multiple linkage groups (see, 
e.g M Figure 7B-H). Most preferably, SETS of markers are part of a single linkage group (i.e. 
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a single chromosome), but this may require significant additional labor because fewer existing 
primers will be suitable. 

Additional or alternative SSR loci to assemble into GROUPS of markers may be found 
in GDB. Loci listed in GDB can be arranged on the genetic map by using map location 
5 information in GDB. Additional or alternative primers may then be designed using information 
on the surrounding DNA sequence available in Genbank, based on the locus designations from 
GDB. GROUP 1 markers (Figure 7A) are currently perform ing well in multiple laboratories. 

In many cases new oligonucleotide primers must be designed from the sequence 
surrounding * a " h ««itVw to produce PCR products that fit between the products of the markers 

10 above and below it without overlap. The new primers can readily be designed from the known 
sequence surrounding the SSR. Criteria for selecting a sequence to be synthesized as a PGR 
primer are well known (see, e.g., Sambrook, et aL, and Innis, et .aL, especially p. 9). 
Pr efe r a bly, the unique primer 3' sequence should contain at least 7 nucleotides, the A G 
threshold should be at least -1.0 kcal/mol, most prefera bly -1.4 kcal/mol, and duplex formation 

12 should be avoided, the rarimnm length of duplex not exceeding 2 base pairs. The sequence 
of preferred primers win also minimi or eliminate sdf-complementarily, hairpin formation, 
and false priming. Once the sequences of candidate primers are chosen, synthesis is readily 
accomplished by standard methods (see, e.g., Sambrook, et aL). 
Optimization of PGR Conditions and Appearance on the Gel: 

20 These new primers must be tested to assure that they produce an easily scored collection 

of products of the correct size. Scoring may be easier if the label is on one primer rather than 
the other for particular markers (see, e.g., Figure 8). Primers developed for dinucleotide 
markers may p erfor m well in the PCR reaction, but produce products unacceptable for 



SUBSTfTUTE SHEET (RULE 26) 



WO 95/15400 



PCI7US94/13945 



-26- 



gcnotyping (single base stuttering bands, stuttering bands of equal intensity with true alkies, or 
stuttering bands that axe larger than the correct allele), and such primers should be avoided. 

For best results, the PCR conditions for each marker should be optimized to eliminate 
any artifactual PCR products due to nonspecific annealing that may complicate the analysis of 
5 a GROUP of combined markers. In particular, the temperature of the annealing phase of each 
PCR cycle should be optimized for each primer pair. Accordingly, the annealing phase 
temperature is set relatively high, so that specific hybridization occurs, but non-specific 
hybridization between the template DNA and the primers is minimized Usually, the selectivity 
provided by this optimization is preserved in the method of this invention by limiting the number 

10 of primer pain in any PCR reaction vessel to those whose optimized annealing temperature is 
the same or nearly the same. Preferably, all primer pain in the same PCR vessel have 
annealing temperatures within 4C of each other. At one extreme, an entire 96 well plate is 
dedicated to PCR reactions using primers for a single marker. (When genotyping is preformed 
for a large number of individuals, using a separate plate for PCR reactions for each marker will 

IS not reduce efficiency.) Alternatively, each PCR vessel on a plate has only one primer pair, but 
the plate contains vessels having different primer pairs, so long as all primer pairs on the same 
plate have annealing temperatures within 4C. In a preferred mode, all of the primer pain for 
a SET or even a GROUP are constructed to have optimized annealing temperatures in a narrow 
range, most preferably 4*C, and all of the primers are present in a single PCR reaction vessel, 

20 obviating the need to mix the individual PCR products prior to dectrqphorctic separation. 

In addition, each marker should be evaluated to assure it is sized correctly within the SET 
and that the alleles can be easily scored as distinct products. Furthermore, reported 
heterozygosity values are usually verified using a population of unrelated individuals. The same 
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DNA templates provided herein may be used as controls for verification of protocols and quality 
assurance. Preferred controls include CEFE parents (BIOS corporation, New Haven, Conn.; 
Cell Repository, Camden, NJ.), such as families 1331, 1347, 884, for which reference alleles 
are known (see, Weber, et aL, and Genethon Microsatellite Map Catalog, Genethon Human 
Genome Research Center, Eviy, France). Pooled DNA from volunteers who have donated 
blood that has been purified as described in the EXAMPLES may be used as weiL 

This optimization process requires the synthesis of oligonucleotide primers, dilution and 
aliquoting of primers, identification of the appr o pri ate annealing temperature (T) and PCR 
protocol, electrophoresis of the products, autoradiography and data analysis. If labelled primers 
are used for detection of products, 5' end labeling of both primers should be tested to determine 
which one produces the best image 1 . The size of the PCR products from each marker should 
be verified experimentally to assure that it does not overlap with the products of the surrounding 
markers in the same SET. As a control for this purpose, PCR products from a pool of DNA 
samples from a population of unrelated individuals may be electrophoresed against a DNA 
sequence ladder. In a piefni e d mode the test pool win contain at least SO chromosomes. 

Initial characterization of primers for each SSR marker may be p e rfor med with ^P labels 
because this is less costly, but the smooth adaptation of fluorescent-based techniques for 
genotyping with markers that have been optimized using *P is also dependent on assuring the 
PCR products labelled with a fluor esc ent dye perform as expected during PCR and analysis. 
Therefore, the reliability of the developed protocol should be checked by electrophoresis of 
DNA samples labelled by PCR with the fluorescent labels. 



Frequently the image produced by labeling one of the pair of primers is blurred, see, 
e.g., Figure 8. 
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The PCR products of different microatellite markers frequently vary significantly in 
intensity (see, e.g., Figure 9). The sizing of fluorescent PCR products of grossly different 
concentrations is potentially complicated by sample overloading, causing spectral interference 
between the dye labels during analysis. There was no int erferen ce in the detection of the 
5 overlapping products using the four dyes in Examples 1 or S, because the concentration of each 
PCR product was determined and adjusted to prevent overloading. However in our ccpcrimne 
this can become a problem when working routinely with 21 to 24 pooled markers. 

Overloading can lead to artifacts that become especially troublesome when they are 
interpreted as internal size standards. To prevent the inaccurate sizing of the products by the 

10 GeneScan 672 software, we have found that the selection of the standard peaks must be earned 
out manually. During large scale applications, such as in our linkage studies, this may become 
a serious problem. Moreover, it is often impractical to estimate the concentration of each of the 
fluorescent products in order to adjust the concentration of the individual samples to be pooled. 
Generally adjustments in the volumes for each marker can be made for all the samples by 

15 estimating the relative intensity of the marktr within a SET. This is easily accomplished by 
referring to the data table of fluorescent band intensities or by viewing the electrpphoretogram 
directly. 

In a preferred mode, PCR products are recovered and combined into a mixture containing 
the GROUP by a simple protocol that uses magnetic separation technology to purify the 
20 fluorescent PCR products and which restricts the total amount of product pooled to prevent 
overloading. Magnetic separation provides simple separations based on specific binding 
interactions without the need for expensive centrifuges. Saturation binding to a Wit™ 4 amount 
of paramagnetic beads can be used to control the amount of labelled PCR product carried 
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forward in the analysis. Relative intensity may be adjusted by this means and overloading may 
be avoided. 

In a prefer red embodiment, one primer is labelled with a component that will bind to 
magnetic microbeads, for example biotin-labelled primers will bind to streptavidin-coated 
magnetic beads* Methods for labelling primers with biotin are taught in, e.g., Innis, et aL, 
"PGR Protocols," 1990, pp. 100-103 and referen ces cited therein. Magnetic beads coated with 
streptavidin are commercially available (Dynabeads™) and p ro c edures for separation are 
described in, e.g., "Magnetic Separation Techniques Applied to Cdlutar and Molecular 
Biology," Kcmshcad, etaL, eds., Wordsmiths' Conference Publications, Somerset, U.K., 1991. 
A fixed amount of magnetic beads axe added to die PCR reaction after amplification using 
primers that will bind to die magnetic beads. Hie magnetic beads with the PCR product 
attached are separated from the remainder of the PCR reaction mixture, including salts and 
unused, detectably-labeUed primer, and then the PCR product is recovered from the magnetic 
beads (for example, by s e par ating the strands, leaving one strand attached to the bead and 
recovering the other strand whose primer carries the detectable label). 

Alternatively, the entire PCR product may be labelled by including biotinyiated DTP in 
the PCR reaction medium as described by Dennis, et aL, 1990, in "PCR Protocols," Innis, et 
aL, eds. The PCR product can be bound to the beads far purification from the PCR reaction 
mix and exces s primer, and subseq u ently recovered from the beads by, for example denanirarion 
of streptavidin. In another alternative mode, paramagnetic beads which have attache to their 
surfaces single stranded DNA cor r es p on ding to a part of the sequence of the PCR product may 
be added to the PCR reaction mix at the end of amplification, followed by cycling above the 
melting temperature, reanneaiing and then separating the paramagnetic beads and any other DNA 



SUBSTITUTE SHEET (RULE 26) 



WO 95/15400 



PCT/US94/13945 



-30- 



strands annealed to .the beads from the reaction mix. Labelled strands can then be recovered 
from the beads, as above. 

Selection of SETS and GROUPS of fluorescent SSR markers covering the human genome 
(approximately 300) can be completed in approximately 6*9 months, using the pr ocedures 
5 provided herein. Preferably, additional fluorescent markers will be developed (approximately 
500 SSR markers) providing a higher resolution tool for gene mapping. The resolution of this 
marker collection will approach 10 cM and will preferably cover the telomeres which will better 
assure linkage detection in complex non-Mendelian disorders like asthma and diabetes. 

The development of a common index set of fluorescent markers that can be used in 
10 multiple laboratories simultaneously should provide certain advantages in genomic studies. 
Typing these common index loci in a number of different populations afflicted with the same 
disorder will facilitate the comparison of linkage results and provide the information required 
for the eventual application of these techniques to forensic medicine. 

The method of this invention offers several significant advantages over a similar strategy 
15 adopted by Diehl et aL, 1991, Am. J. Hum. Genet., £Z:177. Spacing markers in a SET 
according to this invention avoids overlap, providing improved discrimination among markers 
and between markers and artifacts. As many as eight or more markers may be incorporated into 
a SET. When necessary, new oligonucleotide primers based on the unique sequence surrounding 
a polymorphic marker can be designed and synthesized as taught herein to assure that the PGR 
20 products do not overlap during electrophoresis. Errors introduced by sample handling may also 
be mtnimizwl by storing DNA from each individual to be studied in a 96-well format Our 
protocol preserves the integrity of a 96 well format including PCR amplifications, product 
pooling, and sample purification, thereby minimizing sample handling and errors introduced by 
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excessive sample manipulations. In a preferred mode, efficiency is further aided by the transfer 
of a row of samples by multichannel pipette. 

The combined analysis of multiple markers mayimires the use of the Applied Biosystems 
373 sequencer or titmiar automated analysis hardware. Since the capacity of the 373 sequencer 
is 36 lanes per gel, 864 genotypes (1728 alleles) can be analyzed routinely from one gel using 
the semi-automated method of this invention. A typical linkage study would include about 100 
famiiim or about 500 individuals. For a 5-year study including about 300 markers, 
approximately 180 gels, or about 3 gels per month, will be required. By using the method of 
this invention, at least 2 gels per day can be run per 373 sequencer. Thus, up to 12 
investigators can be accommodated on one instrument, which substantially reduces the cost per 
investigator. 

The method of this invention can also increase the efficiency of diagnostic studies of the 
genome, when the desired diagnostic pr oce du res involve the detection of genetic changes that 
affect the length of genomic DNA at 6 or more locations. Such changes include additions, 
deletions, intra-and interchromosomal crossover, gene amplification and similar gene 
rearrangements. The loci of many such rearrangements are known and associated with many 
diseases, especially cancers and metabolic er r ors inherited recessively. PCR using primer pairs 
which direct amplification of a DNA segment including one of these loci can be used 
diagnostically where the rearrangement associated with die disease causes a change in the length 
of the PCR product A SET of pdmers designed according to the principles above can be used 
in the production of PCR products that can be analyzed dectrophoretically in a single lane, for 
more efficient use of electrophoresis and analysis equipment. 
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EXAMPLES 

The following examples describe particular embodiments within the broader invention. 
These embodiments are described for illustrative purposes only, without intention to limit the 
invention. 
EXAMPLE 1 

As an initial test of the fluorescence technology, a study was conducted to compare me 
accuracy and efficiency of these methods with a conventional radiation-based method. Three 
microsatellite loci producing PCR products that overlap in size were chosen to compare the 
accuracy of genotyping by fluorescence versus radiolabding. Discrepancies between the 
genotypes derived from each technique were resolved by repetition. To estimate the variation 
in sizing of the fluorescence-based technique certain samples were loaded on 3 or more gels for 
comparison. DNA from CEPH (Centre d'Etade du Polymorphisme Humaine, Paris) family 
884, 1331, 1332, 1333, 1362 were amplified fbrMarshfield markers, mfd 1 (176-196bp), mfd 
59 (175-195bp), and mfd 154 (186-204bp) using the polymerase chain reaction (PCR). 

Fluorescent techniques'. The forward and reverse primers were each labelled at the 5* 
end for detection by autoradiography with ("P] r ATP(6000 G//unole) using polynucleotide 
kinase. A primer was select ed from each marker for fluorescent labeling on the basis of the 
image of the products (see Figure 8). The optimal annealing temperature was selected for each 
marker enipirically by selecting a temperature that diminated nonspecific annealing or artiractual 
(background) PCR products. Fluorescent labels were attached at the 5' end via phosphoramidate 
derhdti2ationusmgAimnolink2(ApplW Primer B (see Figure 10) for mfd 1 was 

labelled yellow (TMR), primer A (see Figure 10) for mfd 59 was labelled blue (FAM), and 
primer B (see Figure 10) for mfd 154 was labelled green (TOE). PCR conditions were: 0.4 uM 
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primers, 1.2 pM Mgd,, 50 pM Kcl, 200 pM dNTPs and 0.S units polymerase (final con- 
centrations); 94*C for 10 rain; followed immediately by 30 cycles of 94°C for 30 sec; S8*C 
(mfd 59, mfd 154) for 30 sec or 60°C (mfd 1) for 30 sec; and 72 °C for 30 sec; followed by 
72*C for 7 min. PCR was earned out in a volume of 12.5 fd using 25 ng of CEPH DNA. 
CEPH DNA was stored in a 96 well microtiter plate (Peridn Elmer/Cetus). Amplifications were 
performed in 96 well microtiter plates using a Peridn Hxner/Cetus Model 9600 thermalcycler 
and accessories, maintaining the integrity of the 96 well template. Five microliters were 
combined from each marker for each CEPH individual using a multichannel pipett e 
(Transferpette-8, Brinkman). The pooled PCR products were desalted by adding 2 volumes of 
sterile deionized distilled water (ddHjO), ice cold ethanol (100%) equal to the total volume, and 
dulling for 30 minutes at -70*C. The microtiter plate was spun at 4'C at 1400XG for 2 hours 
in a Bfrinna n Model GS6R centrifuge. The yp^-Tumm was aspirated, the pellet was washed 
once with 1.5 volumes of ice cold ethanol (70%), and the plate centrifuged 30 minutes at 
1400XG at 4'C. The supernatant was aspirated and the plate was air dried. Pellets were 
resuspended in a volume of sterile ddHjO equal to the starting volume (pool). 

Radiolabelled products were separated by conventional electrophoresis and scored 
manually from autoradiographs. Fluorescent PCR products were separated on a 373 sequencer 
with internal size standards in each lane (GeneScan 2500-ROX; Applied Biosystem) and analyzed 
using GeneScan" 1 672 software (Applied Biosystems). Each sample (representing 0.5 pi of each 
product) was heated to 99'C after adding 1 pi of the internal lane size standards (GeneScan 
2500-ROX, Applied Biosystems) and 2 pi formamide/EDTA loading buffer, until the total 
volume was reduced to 2-3 fd. Hectrophoresu was carried out using 6% aoyiamide (Biorad), 
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8 M urea (Ultxapuie, USB) gels in 1 X TBE. The reduced volume was loaded and run for 4-8 
hours on a model 373 Sequencer (Applied Biosystems) using a 24 cm well to read distance. 

The size of the PCR product is determined by reference to the internal lane size standards 
(Caxxano et aL 1989, Genomics, 4:129-136). The size standard ROX-2500 (Applied 
5 Biosystems) including fragments: 37, 94, 109 9 116, 172, 186, 222, 233, 238, 269, 286, 361, 
and 479 nucleotides in length was used with modifications. PCR fragments 61 and 68 
nucleotides in length were gel purified, labelled by aminoiinking with ROX, and added in equal 
volumes to the ROX-2500 standards. These fragments were added because desalting by ethanol 
precipitation recovers the unused PGR primers with the products. The intense peak produced 

10 by the unincorporated labelled primer is seen in the standards because of interference between 
dyes and obscures the detection of the 37 nucleotide standard fragment, Therefore, we have 
modified the GeneScan-2500 standards to provide a fragment of known size labelled with ROX 
to accurately estimate the length of the smallest alleles. 

The GeneScan 672 (version 1.0) software recognizes any peak labelled with ROX, 

IS computes a cali br ati o n curve based on a second-order least-squares fit, and uses these data to 
estimate the allele sizes of the PCR products (Ziegle et aL 1992). Data from each lane can be 
analyzed independently, or four lanes of data for a single fluorescent dye can be displayed 
simultaneously to compare individuals within a family. Allele sizes in nucleotide bases, the 
genotypes, are assigned by interactively distinguishing major peaks from background artifacts. 

20 The scale on the display can be adjusted to analyze peaks with differences in fluorescent 

intensity. The intensity of each fluorescent band and peak areas provide an objective method 

of distinguishing alleles from arti&ct (including stuttering bands). Allele sizes can be transferred 

to a spreadsheet database for linkage or a multicolor electrophoretogiam. 
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mfd 1, mfd 59, and mfd 154 PCR products overlap in size (175*204) bp (see Figure 10). 
There was no evidence of interference between the dyes even when there was complete overlap 
during the electrophoresis of PCR products, similar to that reported by Ziegel et aL, 1992. In 
our experience, in terferen ce between dyes does become a problem with overloaded samples. 
A comparison of the genotyping results of the radioactive and fluorescent labeling methods 
revealed 4 discrepancies out of 462 possible comparisons (alleles) (see Table 1). One 
transcription error occurred in the manual data manipulation of the fiuorescently iah»HM 
products. There was no i nte r feren ce bet w een fluoropharcs with the detection of the overlapping 
products using the four dyes. No sizing errors were attributed to the fluorescence-based 
technique and each mariner displayed Menrtdian inheritance. The average size variation across 
all comparisons was 0.28 nucleotides. However, the maximum difference (range) found for any 
of the 462 comparisons was 0.47 nucleotides (see Table 2). Generally sizing varied less within 
a gel than between gels. The variation in the size of the alleles was ****** when comparing 
each of the individual markers. The remaining discrepancies occurred with the use of the 
standard radioactive-based protocol and represented an error rate of less than 1%. Inaccurately 
sized PCR products and s ampl e misleading* produced mistypings with the conventional 
technique (see Table 1). In general, fluorescent internal size standards provided more precise 
sizing than did radiolabeling. These data demonstrate both improved accuracy and efficiency 
for typing SSR mark e r s with use of fluorescence-based techniques. 
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TABLE1 



CEPfl 
DNA/Maiker 


Genotype 
Radiolabdled 


Genotype 
Fluorescence 


Explanation g 


884-18/mfd 1 


178,192 


178,194* 




1331-16/mfd 59 


179,179 


179,185* 


gel loading error fl 


1331-17/mfd 59 


179,170 


179,185* 


gel loading error fl 


61332-15/mfd 154 


185,200* 


200,200 


recording error ' | 



10 * indicates c o r re ct score by length in nucleotide residues 

TABLE 2 

15 



j COMPARISON 


1 RANGE 


(in nucleotides) | 




Maximum 


Average 


Standard Deviation | 


1 integer* 71 


0.47 


0.28 


.08 | 


| intragel 571 


0.42 


0.18 


.07 | 


mfd l* 7 


0.35 


0.19 


0.1 I 


mfd59 m 


0.37 


0.15 


.08 I 


mfd 154" 7 


0.42 


0.23 


.06 I 











25 Superscripts indicate number of samples 
EXAMPLE 2 

Mapping with Fluorescent Primers 

Genomic DNA is isolated as described by MJ. Johns, et al. f Analytical Biochem., 
30 120:276-278 (1989). 

To minimize sample handling, DNA templates can be stored in a 96 well grid (e.g., 
Perltin Hmer/Cetus). The integrity of the grid may be maintained throughout the protocol to 
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avoid errors introduced by manual pipetting and sample handling. Multichannel pipetting from 
a 96-well grid expedites sample handling while minimizing human errors . 

PGR is performed in a reaction volume of 12.5 /d, containing 50pM dATP, dGTP, 
dTIP, dCTP; 0«07^M of the labelled oligonucleotide primer, and 4 /cM of the unlabeled 
primer. Taq polymerase (Perlrin-Elmer\Cctus) 0.5 units is added on ice. PCR will usually be 
performed in a thennalcycler, e.g., a Perkin-Elmer\Cetus 9600 thermalcyder. Standard 
thermalycyder settings axe 94 # C for 10 minutes, followed by 30 cycles 94°C for 30 seconds, 
30 seconds at avenge annealing temperature for the primers and 72*C for 30 seconds; final 
extension is at 72*C far 7 minutes. 

Labelled PCR products are purified by co-precipitation in EtOEL 24 markers may be co- 
pre cipit a t ed simultaneously in the 96-well f orma t using ethanoL Ethanol precipitation desalts 
the products but eppurifies the primers. Hie labelled primer peak produces an enormous signal 
that complicates the analysis of products under 93 nucleotides in length because it interferes with 
the 37 nuc l e o ti d e ROX GeneScan-2500 standard. As an alternative, internal standards may 
incorporate fragments that are SO, 60, and/or 70 nucleotides in length in addition to the 
GeneScan 2500 standard fragments or an equivalent set of fragments. 

The amplified products are analyzed by denaturing gd electrophoresis (Sambrook, et aL). 
Loading buffer (2X concentration) is added to an equal volume of die PCR reaction, and the 
PCR reaction is loaded on a 6 % polyacrylamide geL Radioactive products will be sized against 
a sequence ladder, the gels are dried and then exposed to Kodak XAR film for 4-24 hours with 
or without intensifying screens. Fluorescent labelled PCR products may alternatively be 
analyzed by semi-automated detection using, e.g. f an ABI 373A automated sequencers and 
GeneScan 672 software from Applied Biosystems, Inc. 
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EXAMPLE 3 

PGR products are produced as in Example 2 and then purified and combined for 
electrophoresis using a magnetic bead protocol in place of EtOH precipitation. One of each pair 
of primers is labelled with biotin and the other with a fluorescent label as above. Double 
stranded PCR products are purified using streptavidin conjugated to paramagnetic beads to bind 
the primer 5* labelled with biotin. This procedure may be easily adapted to the 96-weil format 
in any laboratory without expensive centrifuges. After the DNA bound to magnetic beads is 
separated from the PCR reaction tp^**, the two strands are melted and separated, and the strand 
labelled with the fluorescent primer is pooled with other labelled strands of its GROUP for 
electrophoresis. The result of increasing the amount of beads used for separation of a single 
PCR product from its PCR reaction mix is shown in Figure 12. 
EXAMPLE 4 

*P OPTIMIZATION OF PRIMER SETS 
DNA Templates 

Q .PH parents and/or unrelated volunteers as controls may be tested. In addition, we 
usually include one "no DNA" control and one r e feren ce individual (alleles known) on each 
plate. To mairimiTft the use of resources, each marker may be optimized, using 12 wells or less 
of a 96- well plate. Eight markers are amplified per plate at a single t emper ature. Alternatively, 
a thermalcycler with a tmallfr sample capacity may be used. 

The 5' end of the primers to be tested is labelled with *P using the polynucleotide kinase 
reaction. Mix 5p sterile ddHjO, 2.8 fd 5x kinase buffer (250 mM Tris, 50 mM MgQ 2 , 50 mM 
DTT, 0.25 mg/ml BSA), 6.0 id 10 fdA primer, 0.8 fii T 4 polynucleotide kinase, and 3.0 fd t^P 
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ATP (6000 Ci/mmol). Incubate at 37» for 1 hour, then add 26 mI sterile ddHA spw through 
select D column (Five Prime Three Prime) loaded with P4 Biolgel (BIORAD) according to the 
manufactnrers recommendations. The labelled primers may be stored at -20 - C. 

For optimization, set op simultaneous PCR reactions as described in Example 2, using 
DNA templates described above (e.g M 2 CEPH (133M, 1347-02), 1 pooled sample (50 
ctomosomesJ.lnoDNA). Perform PCR at the annealing tennjerarure (T^ ca^ 



follows 



T* - 2(A+T) + (Q+Q (Vthcala^ttmpm^fyrlpmn^g^y, for 
example 54* and 64*, begin closer to lower 

Check the amplified PCR product for artifact by electrophoresis on 6% gel. Continue 
optimization of the selected ^-labelled primer with control individuals, mcreasing the annealing 
temperature in 2' increments until nonspecific products arc eliminated. On average, 
determination *aj^ 

When all markers from a SET are optimized (usually 8 marVers), 3 fd from a pool of 
PCR product of DNA from unrelated individuals using primers for each marker in the SET is 
combined with an equalvtuume of h^^caar**^ Seven sd (or n«ximum 
^volun*)ofthecon^ This last cheek 

on size and product assures n^at the mate 

nucle^apart. Tneprimer sequences may then be use* to svnthesize fiuor^ 
products. 

EXAMPLE 5 

A protocol extending this approach to include up to 24 microsateUite mate in each 

dectrophoretic lane was tested as follows The «.t~ti M . 

as rouows. me selection of markers was based on the need to: 
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maxixnize heterozygosity (genetic infonnativeness), distribute markers across the entire genetic 
map, and the placement of the marker within a SET based on the known size of the PCR 
products (alleles and stuttering bands produced must not overlap with those of the marker above 
of below it). 

Highly informative microsatellite markers were assembled into a ladder or 'SET* . Each 
marker in a SET is spaced a distance of at least 9 nucleotides from surrounding markm such 
that none of the PCR products overlap in size when separated on a 6% denaturing aoylamide 
geL Since many dinucleotide repeats produce a complex pattern of 3 or more stutter bands, this 
spacing is critical to assure that more intense stutter bands from an upper marker, will not be 
misinterpreted as a product from a lower marker. In addition, new alkies both larger and 
smaller than the reported product sizes for this type of marker have occasionally been 
discovered. Each SET was lafrrHH with one of three different commercially available 
fluorophores (TMR, FAM, and JOE; Applied Biosystems). The fourth fluorophore (ROX) was 
reserved for the internal size standard. Three SETS each labelled with a different fluorophore 
were pooled into a collection of markers we have termed a "GROUP*. 

New primers were designed as necessary using OUGO 4.0 (Research Genetics, 
Huntsville, AL) to fit within the marker ladder. Each GROUP was constructed to avoid overlap 
between markers within SETS but to allow overlap between SETS. 

The autoradiographic image pmducerf by many maito* varigri Hf^^ng <m whether the 

forward or reverse primer was labelled (see Figure 8). Therefore, both primers from each 
marker were evaluated for image clarity and the ability to distinguish the most intense produces) 
or alleles. The appropriate primer was then selected for further use. Optimization of the PCR 
conditions for each marker was also accomplished using radiolabeHng. The strategy of 
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developing a ladder of markers warranted that the conditions for PCR eliminate nonspecific 
annealing and background bands. When nonspecific annealing could not be eliminated by raising 
the annealing temperature, anew marker was chosen for use. Thus uniform PCR conditions as 
described in Example 1 were used for all the markers chosen except that the »m,~«» c 
temperature was specific to each marker. GROUPS 1 and 2 have 6 and 9 different annealing 
temperatures, respectively (see Figures 7A and B). An entire rmcrotiter plate containing DNA 
from a number of different individuals will usually be amplified for a given marker at one 
temperature at a time, so this should not reduce the overall efficiency of the protocol. For 
studies wim fewer sainples 

Variability among tnermalcyder operating temperatures may require adjusting the 
annealing terrrperature when switching from one machine to another. Therefore me use of the 
protocols described for marker GROUPS 1 and 2 should be preceded by a Revaluation of me 
suggested annealing teinperamres for optimal perfcrmance. This can generally be carried out 
once on a few markers and when necessary the annealing temperatures can be adjusted up or 
down for all the markers for that machine. 

The intensity of the products varied considerably from marker to marker. Whenmarken 
were radiolabelled and a SET was run on the same gel, detecting all of the products on the gel 
with a single film exposure was often impossible. Attempts to score on a single gel the larger 
products in each SET using radic4Ctfve-based techniques were unsuccessful. Although gradient 
gels improved the band spacing, a maximum of 4-5 markers could be resolved per gel on 
autoradiography An autoradiograph of GROUP 2 SET B is shown in Figure 9. The range of 
intensity in the products of this SET is typical of this type of marker and multiple 
aumradiographs are required for genotyping. These problems are partially overcome by the use 
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of fluorescent labels (Ziegle et al., 1992). Fluorescent signal detection is linear over a greater 
range, so that the markers with the weakest product intensity are more readily typed in real-time 
along with the most intense products from other markers. 

Marker GROUPS 1 and 2 arc described in Figures 7A and B, respectively. Hie primers 
sequence, chromosomal location, choice of labelled primer, and optimal amwaKng temperature 
is listed for each locus. GROUP 1 is composed of a combination of 21 di-, tri-, and 
tetranucleotide markers from multiple linkage groups. The product sizes range from 66 to 322 
nucleotides. Group 2 is composed of 24 ^nucleotide markers with products ™»gi"e in size 
from 75 to 349 nucleotides. The mean heterozygosity for both GROUPS is 74%. 

Scoring of the fluorescent products using the AM 373 sequencer and GeneScan 672 
software was unambiguous in samples that were desalted by ethanol precipitation. Desalting was 
carried out as Mows: 5 fil of each PCR product from the same SET (like cote) was combined. 
Then 1.0 fd per marker per SET was combined for each of the 3 SETS giving a final volume 
equal to the total number of markers in the GROUP. Sample handling was otherwise exactly 
as described above for the individual fluorescent markers. 

A typical set of electrophoretograms of each SET from GROUP 2 for a single individual 
is illustrated in Figure 5. Each of the alleles can be easily recogmzed by the unioue signamre 
of tr* stuttering bands for these dinucleotide repeat markers amplified by PCR. Samplesthat 
were not desalted were difficult to score because the mobilities of the products and the 
ROX-2500 internal lane standards were altered. Salt and primer loads become a problem when 
wmbining multiple products for electrophoresis because the necessary volume reduction results 
in sample concentration. The salt concentration rises with the product concentration and 
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interferes with the separation of the products and standards. This becomes critical when pooling 
21 to 24 markers. 

It will be understood that while the invention has been described in conjunction with 
specific embodiments thereof, the foregoing description and examples art intended to illustrate, 
5 but not limit the scope of the invention. Other aspects, advantages and modifications will be 
apparent to those skilled in the art to which the invention pertains, and these aspects and 
modifications are within the scope of the invention, which is limited only by the appended 
claims. 
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CLAIMS: 

1. A kit for use in automated gcaotyping within a population comprising at least 4 
GROUPS of at least three SETS n ri i comprising labelled pain of primers for amplification of 
DNA by polymerase chain reaction (PCR), 

each primer pair having unique sequence found in the flanking sequences of a 
microsateilite sequence comprising a nucleotide repeat sequence flanked by unique sequences, 
such that a polymerase chain reaction (PCR) primed with the primer pair amplifies the 
nucleotide repeat sequence and at least some immediately adjacent unique sequences of the 
microsateilite sequence to produce a PCR product identified with the primer pair, wherein the 
microsateilite sequences axe nucleotide repeat sequences thai are polymorphic within , the 
population, 

each SET consisting of at least 6 primer pairs, each primer having the sequence 
of unique sequences respectively flanking at least 6 microsateilite sequences in the genome, such 
that the length of the segment amplified by a particular primer pair differs from the length of 
all other segments in the SET by at least 5 nucleotides, and at least one primer of each primer 
pair is labelled with a fluorescent label that is the same fluorescent label for all primer pairs in 
the SET, 

each GROUP consisting of at least three SETS of primer pairs labelled with 
fluorescent labels, wherein the wavelength at which the respective fluorescent labels fluoresce 
is substantially different for the labelled primers in each of the respective SETS, 

wherein the distance in the genome between one microsateilite sequence amplified 
by a primer pair of the lot and the nearest other microsateilite sequence amplified by another 
primer pair of the kit is at least 2 centimorgans (cM) and no more than 50 cM. 



SUBSI]FJ7ES^£T{RWI26) 



WO 95/15400 



PCT<X'S94/13945 



-45 - 



2. The kit of claim 1, wherein the PCR products identified with any primer pair 
amplifying microsatellite sequences containing dinucleotide repeats differ in length from PCR 
products i d e ntified with all other primer pain of the same SET by at least 9 nucleotides. 

3. The kit of claim 1, wherein one of said GROUPS consists of the three SETs of 
Figure 7A. 

4. The lot of claim 1, wherein one of said GROUPS consists of the three SETs of 
Figure 7B. 

5. The kit of claim 1, containing the 6 SETs shown in Figures 7A and 7B. 

6. A method of analyzing genomic DNA for the presence of polymorphisms 
comprising 

a) extracting DNA from a human sample; 

b) coiribining, in a polymerase chain reaction (PCR) vessel, an aliquot of said 
DNA from a human sample, at least one primer pair selected from a GROUP in the kit of rf«m 
1, and PCR amplification enzymes; 

c) cycling the temperature of each PCR vessel so that PCR products identified 
with said at least one primer pair are produced by PCR amplification of segments from said 
DNA from a human sample, each vessel being cycled at an annealing temperature wherein non- 
specific annealing of the primers to said DNA from a human sample is minimized; 

d) then combining all PCR products from all PCR vessels containing primer 
pairs from one GROUP into a mixture, and subsequently separating the mixture of PCR products 
electrophoredcaUy by 

e) detecting separated PCR products by fluorescence aetection at wavelengths 
awesjxMiding to the fluorescent wavelength for each of the fluorescent labels in the kit 



c!ia«Trrsrn: ewrrr /©in r oc\ 



WO 95/15400 



PC77US94/1394S 



-46- 



7. The method of claim 6, wherein the step of combining amplified DNA further 
comprises: 

i) contacting each vessel with a plurality of paramagnetic beads carrying on 
the surface a protein which specifically binds biotin, further wherein one primer of each primer 
pair is labelled with a fluorescent label and the other with biotin, for a period sufficient for said 
protein to bind biotin; 

ii) separating the magnetic beads from the PGR reaction medium; 

iii) separating the two strands of the amplified DNA segments and combining 
the strands labelled with a fluorescent label for all primer pain from one GROUP into a 
mixture* 

8. The method of claim 6, wherein the step of combining amplified DNA from the 
PCR vessels further comprises: 

i) contacting each vessel with a plurality of magnetic beads carrying DNA 
complementary to the sequence of one primer of the primer pair in the vessel for a period 
su ffici e n t to allow annealing between the primer and the DNA on the magnetic beads; 

ii) separating the magnetic beads from the PCR reaction medium; and 

iii) eluting the PCR product from the magnetic beads. 

9. The method of claim 6, wherein each primer pair of said kit is a rid f d to a 
different PCR vessel in step (b), such that the annealing temperature for temperature cycling in 
step (c) is the temperature wherein non-specific annealing of the unique primer pair is mimimrgH 
and PCR product from all PCR vessels containing at least one primer pair from the same 
GROUP are combined in a single mixture before electruphoretic separation. 
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10. A method for selecting a SET of PCR primers for use in automated genotyping 
comprising 

selecting at least 6 microsatellite sequences in the human genome, wherein the 
micxosatellite sequences are selected from dinucieotide, trinucleotide and tetxanudeotide repeat 
sequences that axe flanked by unique sequences, said microsatellite sequences being separated 
from each other by at least 2 centimorgans in the genome and being polymorphic within the 
population; 

constructing primer pairs for each microsatellite sequence, said primers having 
the sequence of the unique sequences flanking the microsatellite sequences, such mat the length 
of all polymorphs of the DNA segment amplified by a particular primer pair is detectably 
different from the length of all polymorphs of other segments amplified by primers in the SET. 

11. A kit for use in automated genotyping comprising at least 4 GROUPS of at least 
3 SETS of PCR primers obtained by the method of claim 10. 

12. The kit of claim 1 1 , wherein at least one primer of each primer pair in the SET 
is labelled with a fluorescent label that is the same fluorescent label for all primer pain in me 
SET. 

13. The kit of claim 11, wherein the length of all polymorphs of the DNA segment 
amplified by any primer pair amplifying rnicrosatcllite sequences containing dimicleotide repeats 
differs in length from the DNA segment amplified by all other primer pairs of the same SET by 
at least 9 nucleotides. 

14. A method of analyzing genomic DNA for the presence of polymorphisms 

comprising 

a) extracting DNA from a human sample; 
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b) combining, in a polymerase chain reaction n>™ , 

c) <0«% the ttmpeaa,. of each PGR ,. 

« ^ vessel so thai prp «^ — 

5 «sentially of amplified DVa PTOducts 

<0 xpacunt etocwphonaian, bv <i~ . • 

' ^P*™* JaeeaWy labelled Pes ™^ 

thcmbyieog*. PCE P™* 1 ™ ml characterizing 

I5 - netted of claim 14 wherein a, • 
prDdKO ""P^ fc» aid DNA ' . ° """^ k Cd) conntatag ^ pQj 
obtai^by: *" * taM " M * * "XP^pai, of *d ^ „ 

^•obi^biodn; . * , " - * , *"* ,, **»»— 

ft) seoarati reacCon medium; 

*e stands labelled ^ . /^"^^ ^ ^ segmenu and confining 

^th a fluoreseem labd for aJl priae, ^ , 
mixture. ^ 6x301 one GROUP into a 
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16. Hie method of claim 14, wherein the mixture in step (d) containing all PCR 
products amplified from said DNA from a human sample by any primer pair of said SET is 
obtained by: 

0 contacting each vessel with a plurality of magnetic beads carrying DNA 
complementary to the sequence of one primer of the primer pair in the vessel for a period 
sufficient to allow annealing between the primer and the DNA on the magnetic beads; 

ii) separating the magnetic beads from the PCR reaction medium; and 

iii) eluting the PCR product from the magnetic beads, 

17. A kit for analysis by polymerase chain reaction (PCR) of a genomic region 
containing at least 6 known loci at which genetic rearrangement is Hiap.^ f or a 
comprising at least one SET containing at least 6 PCR primer pairs, 

each primer pair having the sequence of unique sequences flanking one of said 
at least 6 loci of genomic rearrangement, such that a polymerase chain reaction (PCR) primed 
with the primer pair amplifies the DNA segment surrounding the locus of rearrangement to 
produce a PCR product of characteristic length, wherein the length of the PCR product is 
associated with specific diagnostic information, and wherein the length of the PCR product 
airtplified by a particular pair of primers differs from the length of all other PCR products 
amplified by other j>rimers in the SET and the PCR products for all primer pairs in the SET are 
detectably labelled with the same label. 

18. A diagnostic method for detection by polymerase chain reaction (PCR) of genomic 
rearrangement in a genomic region containing at least 6 known loci at which genetic 
rearrangement is diagnostic for a disease, comprising 

(a) extracting DNA from a human sample; 
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(b) combining, in a polymerase chain reaction (PCR) vessel, an aliquot of said 
DNA from a human sample, at least one pair of amplification primers selected from a SET of 
at least 6 primer pairs, and PCR amplification enzymes, each primer pair of said SET having 
the sfqucncff of unique sequences flanking one of said at least 6 loci of genomic rearrangement, 
such that a polymerase chain reaction (PCR) primed with the primer pair amplifies the DNA 
segment surrounding the locus of rearrangement to produce a PCR product of characteristic 
length, wherein change in the length of the PCR product is a ssociated with rearrangement at the 
locus of rearrangement, and wherein the length of PCR products amplified by a particular pair 
of primers differs from the length of all other PCR products amplified by other primers in the 
SET; 

c) cycling the temperature of each PCR vessel so that PCR products 
consisting essentially of amplified DNA segments labelled with detectable labels are produced 
by PCR amplification and the PCR products for all primer pairs in the SET are detectably 
labelled with the same labd, each vessel being cycled at an annealing temperature wherein non- 
specific annealing is minimized; 

d) separating electropharetically by size a mixture containing all PCR 
products amplified from said DNA from a human sample by any primer pair of said SET; 

e) det ect i n g separated detectably labelled PCR products and characterizing 
them by length. 

19. The method of claim 14, wherein each primer pair of said SET is added to a 
different PCR vessel in step (b), such that the annealing temperature for temperature cycling in 
step (c) is the temperature wherein non-specific annealing of the unique primer pair is minimized 
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and PCR product from all PCR vessels containing at least one primer pair from said SET 
combined in a single mixture before electrophoretic separation. 
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FIG. 1 
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FIG. 2 
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FIG. 4 
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FIG. 7A-I 
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F/G. 7A-2 

GROUP 1 



A Primer 



B Primer 



S'-GTC AGC ACC CCA ACC AGC CT-3 • 
5VTCC AGC CTC GGA GAC AGA AT-3 ' 
5*-GTTAGC ATA ATC CCC TCA AG-3' 
S'-AAG AAC CAT GCG ATA CGA CT-3 • 
5 # -CAT AGC GAG ACT CCA TCT CC-3* 
S'-CAG AAA ATT CTC TCT GGC TA-3' 
5'-AGC TAT CAT CAC CCT ATA AAA W 



5*-ACC GAA GAC CCC TCC TCT GG-3' 
5 '-ACT CCT TTC TCC AGA GCA GGT-3* 
S'-CGA TGG AGT TTA TCT TGA GA-3 • 
5'-CATTCC TAG ATG GGT AAA GC-3' 
5XX3G AGA GGG CAA AGA TCT AT-3* 
5'-CTC ATG TTC CTG GCA AGA AT-3 ' 
5'-AGT TTA ACC ATG TCT CTC CCG-3 * 



5'-CTG TTA TGG GAC TTT TCT CA-3* 
5'-ATC ACT TCC CCA CTT TTT AW 
5'-ACT TTG AAA ACC ACT GGC CT-3 * 
S'-ACC TAT AAT TGC ATC ATT GCA-3' 
5*- ATC TCT GTT CCC TCC CTC TT-3' 
5'-AAG CTT GTA TCT TTC TCA GG-3' 
5'-GTA TTT TTG GTA TGC TTG TCW 



5'-AAT GTA TGA AGT GGT ATG AT-3* 
S'-GCT GAG ATG GGA GGA TTC CT-3* 
5*-ATC TAT CTA GCC ATG GTA GW 
5 f -TOG TCT ATA ACT GGT CTA TC-3 • 
5'-CTT ATT GGC CTT GAA GGT AG-3 * 
5*-ATC TAC CTT GGC TCT CAT TG-3 * 
?-CTA TTT TGG AAT ATA TCT GCC T-3 * 



5'-AAT CTT CTT TTT TCT CTA TGA-3* 
5*.GTO CCA TIT TAC AGT CTC CT-3' 
5'-GCT AGC CAG CTC GTC TTA TT-3* 
5--GAG AGG GAG GGC CTG CGT TC-3' 
5 f -TTA AAA TCT TCA AGG CAT CTT W 
S'-TTC TCA TAT CAA AAC CTG GW 
5'-AAA AGT GTC TTA CTT TCA GAA C-3' 

SUBSTITUTE 



5'-CGT TTC ACT CCG TCT GTT TGA-3 ' 
5 , -TTT CCA TTC TCT GTC CGT TT-3' 
5'*ACC ACT CTC GGA GAA GGG TA-3 • 
5*-CAC CCA GGG CCA GAT AAA GA-3 ' 
S'-TTT GAG TAG GTC GCA TCT CA-3 ' 
5'-AAG GAT ATT GTC CTC AGG A-3 ' 
S'-ACA AGG TGA CAA GGT GCC TA-3 * 
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FIG. 7A-3 
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FIG. 7B-I 

Maricer Alleles (bp) Heterozygosity Chromosome 
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FIG. 7B-2 

GROUP 2 



A Primer 



B Primer 



5'-AAA CCC AAA CCC AGA GGA TM* 
f-GGC ATG TCA TTT TCG TAA GC-3* 
y-AAT ATG GCT ACA GCA TTG GA-3 ' 
5*-GAG CGA GAG CAA AAT GAG CC-3' 
5**ATA TGG AAA CTC TCC GTA CT-3' 
f'-AGT TAC ACC GGT TCT GCA GA-3 ' 
5'-ACT GCC TCA TCC AGT TTC AG-3 • 
S'-TCC TGG CTt TAA ACT TCA CAC AC-3 * 



5 f -AGG TGG GTG GAT AAC TTG AG-3* 
5'-GTG GGC CAC ATT AGG AAC AG-3* 
5**TGG GCG ATT TGT TCA TTG TG-3' 
5*-TGG AAG GAC GGG AAA TAA TA-3' 
5*-GCA ACC ATG GAG AGT CTG GA-3' 
5'-GAT TAA TGA TAG TGC TAT CC-3' 
5'-GAG CAG GCA CTT GIT AGA TG-3' 
5'-GGA ATA TGT TTT TAT TAG CTT GT-3' 



5*-GAA CAG AAC AGT GGA GCA TC-3' 
5»-TAG GAG GCA GAG GAT GGT TC-3* 
5*-CCC CAC TCT TAG CCA TTG TA-3' 
S'-TGG AGA TGT CCC ATA GAG GT-3 • 
5*-TTC AAG TGG TTG CCT CTG GC-3' 
5*-ATG CTT TAT CCA GAG AAA AG-3* 
5*-CAAACT TTC CAC AGT ATC GTT C-3' 
5'-CCA AAT GCT GGA GAC AGA GAG AA-3* 



5 f 43GC ATA CGA GAA AAT ACT GT-3' 
5--CAC CAG CCC CAT TCC TTA GC-3 ' 
5*-GAG ACA CAG AGC AAA TAG GT-3' 
S'-TCA GGA AAA CTG CCT GAG G-3' 
5-AGC / iC TTG CCC AGG CTA TGA-3 • 
S'-CAT CAT TAA TTG GAT TGT GG-3* 
S'-GTTTCC TTG AGA AGA ATG GAG C-3* 
5*-ACC CCT CCC TCC CTC CAT CAC AC-3* 



5VTTC TCA CAA AGT CAC CAC AT-3' 
S'-GGC CTC CTG GAA TAA TTC TC-3* 
S'-CTT GTT CAT CTG CCT TGT GC-3 f 
5'-ATC AAT GGA AAA ATG GGT AA-3 ' 
5'-ACT GGG GAA CAT GGT GGG GT-3' 
5VTITATG CGA GCG TAT GGA TA-3* 
S'-TCC TCA AAA TGA AGA ACA CA-3' 
S'-CCT GGA AAA ATG GCT CAC C-3* 



5 f -TAG GGA AAA TGA CAG GAA AA-3' 
S'-CAT TTT AAT GAA CAC CGC TC-3 * 
5*-ACC TAA GCG ACT GCC TAA AC-3' 
5VTAT CTT TCT CTC TCT GCC TT-3* 
5'- ATG ATG ATT GCC AAA GGG AA-3' 
5'-CAC CAC CAT TGA TCT GGA AG-3 • 
5*-AAA AGT CTA GTG TTG AGT GT-3 w 
S'-GGA AAA TCa UTC TCT AGT TC-3 ' 
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FIG. 7B-3 
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FIG. 7C-I 



Marker Alleles (bp) Heterozygosity Chromosome 
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FIG. 7C-2 

GROUP 3 

A p ™*r B Primer 

S'-CAG GAA ACT GGA TGT GAC CA-3' 
5'-AGC TCC GCT CCC TGT AAT-3' 
5'-CAA GGT TTC ACC ACA GTT CT-3' 
S'-AAG GCA GGC TTG AAT TAC AG-3 ' 
5'-CTC AAA ATG ACT GAT GGG GT-3' 
5'-GCT CCA GCG TCA TGG ACT-3' 
5'-GAG CAA GCA TCC AAA AAC GA-3' 
5*-GCT CAC TTC ACA TTC GTG G-3* 



5'-GAC TTC ACC ATC AAC GCC TC-3' 
5 f -GAG CAG CAC CGT ACA AAT-3* 
5 f *TAA CAT GAG CGA ATG GAC AA-3 • 
S'-GCC CAG GAG GTT, GAG G-3* 
5'-GGT ATG GAA GTC ACC CAA CA-3 ' 
5'-CAC ACA GGC TCA CAT GCC3' 
5'-TCA TGT CCC TCC TCC CAA AG-3' 
5'-CCT ACT CAG GCA TCA GCG-3* 



5'-CATTGC AAA CTC AGG AGA TA-3' 
5 -AAA CTC TGG TCC TGG. CTG-3' 
5*-AAA TTC TAG ACA TCG CCT CTA A-3* 
5'-CAC ACA GCT AGG TTA GAA GGA TC-3* 
5'-CCA GNC TCG GTA TGT TTT TAC TA-3' 
5 '-AAA AAC GTA CTC CCA CAT TC-3' 
5'-AGC CAG CAT TAC CTC TGN TAC C3* 
5'-TTA GCA AAT CCC AAG CAA TA-3' 

5* -GGT GCC AGA CTA TCC AGA CC-3' 
S'-GGC TCT GGG TGT TTC TCC TA-3' 
5'-GAT CGC CTA TCA CCT CCT TG-*3 
5'-TTA ATA AAA ATA CCC CCA CC3 • 
5*-GCG CTC TTG GTA TAT GGT ACA G-3' 
5*-GAA TCT GAA AGG CTC TGC-3' 
5'-TCG CCT GAA TAG ACC ATA AAA A-3' 
5'-CAA CAC CCA AAC AGA TGA CC3 # 

SUBSTITUTE 



5VTAA CAG AGG CAT GAA AAC CA-3* 
5'-AAA CTA GAG TCC TGG CCT GA-3 • 
S'-GCT ACC ATC ACC ACA ATC AA-3 • 
5 # -TCT CTT GCT GAA TTG ACC CT-3* 
5'-CTC AAA CCT CTC TCC AAG CC-3 # 
5'-ACT TCT AGG CCT GTT CTC AG-3' 
5M3AT CAC AGA TAT TCG CCC ATA G-3' 
S'-CTG ATC GTC CTA AAG GCA GA-3* 

5 # -TAT GCT GAT TTA GGG AGC CC-3* 
5'-AGC TCT CAT GNC TTT ACA TTC T-3 • 
S'-GCT GTC TCT GAG ACT TCG CA-3 ' 
5'.GGA AAT AGG TCT GAA CAA AA-3 ' 
5*-TCT GGG CAA CGT CAC TC-3* 
S'-AAA ATT ACA AAG AAG ACC-3 ' 
S'-GCC TGG GTC ACA AAG CA-3 ' 
5VAGT CTT TCA TCG CCA CTC TOO* 
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FIG. 7C-3 
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FIG. 7D-2 

GROUP 4 

A Primer 

S'-GAG GCA CCA GAA TCA CTT-3* 
5--AGA TCA CGG GTA ATC TTG GA-3' 
S'-TTC GCT CTT TCA TAG GC-3' 
S'-CCC CTT GOA AAA TCA CTG-3' 
5'-CCT AAG TAG GCA GTT GGT AT-3' 
5'-AAC TTA CAC ATT TGG CCC TG-3' 
J'-AAC TGC AAC ATT GAA ATC GC-3' 
•S'-TGG AAA CTA TGTATCTTG GAG G-3* 

5' CAT ATG CAT ACC ACA CA0 3 - 
S'-AOC TCA GAG ACA CCT CTC CA-3' 
5*-TCA (3CC TGA GTT TTC TIT AT-3 • 

5'-GGT CTG ATG AAA ATG TTC TCA AGC-3 • 
**-AAC CTC TGC TCG TCA GAG TC-3' 
5--GCC TTG GGG GTA AAT ACT CT-3' 
^TTTTCTTTTTTG CAOTTT ATC C-3' 
5 '" ATC TTC CAA AAA TOT CAT-3' 

5*-GGC CAG GCT TTG TTC AGA-3* 
5'-TTT AGC CTG AAA ATA CAC GC-3' 
r ' TCC ACA AAG GAA CAG GT-3" 
5'-GAT CTG ATT ACT ATT CTC TGC TTG A-3 • 
S -AAA TGT GAG TAG AAG GGA TAG GTT-3' 

S'-GAG TGG CGG TGA GAA GGT AT-3' 

5'-TGG AAT TTC TCC ATG TTG AG-3* 

5--GAA AAG AAT GCT GGA TAG-3' 
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FIG. 7D-3 



B Primer 



T-ATG GTT GTA GAT GAG ACT GG-3' 
S'-AAG CAT CTT AAT GGA TGG AAA-3 • 
S'-ATT TCA TIT GTA ATT TAG TAG CAG-3' 
5'-CCA TGA ATA AGC CTT GCC-3 ■ 
S'-CAC AGC AGG GGT TCATTTTT-3' 
5*-TCA ATC TGT GGA GTC ATT GG-3 • 
S'~QGG ACC ATA GTT CTT GGT GA-3 B 
S'-CCN GGC TTT AGG GTG G-3* 

5*-AAT CTT ATT GCT GTC TCA-3' 
S'-CTC TAT TAG GAT ACT TGG CTA TTG A-3 • 
5'-CAA GGA GCA GGA AGA ACA GC-3* 
S'-TAG ACT GGG TTG TTA GGG ACT CTC-3' 
5'-CGA CTA CGT GCT GGC TAC TT-3* 
5*-GGA ATT ACA GGC CAC TCC TC-3 ' 
S f -CAC TIC AGT GCC TTC TTG AGA*3* 
5'-CAT AAT AGG AGA ATA AG A-3' 

5'-CAG GGT CTA TGA TAC GCT TT-3' 
5 '-GCT TTG CTC CTA GAG TCC AG-3* 
5'-CAT AAT TTG CTG CTT TGG AT- J* 
S'-GCTTTA TAG GAG GTA TCT TTN TGT G-3 f 
5*-TAA AAA AGN CCG ACT AGA CC-3 * 
5'-AGC CAT TGC TAT CTT TGA GG.3' 
S'-AAG AGC TAT GAA AAG AGT TAA AGG A-3 • 
5 f -CCA GTT TTT ATG GAC GGG GT-3' 
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FIG. 7D-4 
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FI G. 7E-I 



Marker Alleles (bp) Heterozygosity Chromosome 
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FIG. 7E-2 

GROUP 5 

A Primer 



5*-AGG TCA TTG AGG TIT ATA TTC CCA-3* 
5*-ATC AGG AGA TGT TGC CTT GC-3* 
5'-AGG CAT ACT AGG CCG TAT T-3' 
S'-CAG ACA ATG GCT TCC AAA ACT A-3' 
S'-CCT GAA GGG TGT AAT TTT CA-3* 
5'-TGA TTG GAG GTG GTA GAG GT-3' 
5*-ATA ATA TCC TTT GAT CCT TTC GCT A-3* 
5'-TTC CTC ATT TAC CTO CAC TAA C3' 

5'-CAC CAT CTC TGT GCT ATT GG-3' 
5MTC TGC ACT CGT TAT GAG AA-3 * 
S'-AAC TAA GAC ACA CAA CCC CG-3* 
S'-CTG CTG GAA CTT AAA ACT GC-3' 
5'-CAA CAG ATC TCC CAA GGT AG-3 ' 
S'-AGQ CTG TCT TGG CAG AAA T-3* 
5'-GAG GGC TGT TGA CCC AC-3* 
S'-TCG GTA AAC ATT CAT CCA GA-3* 

S'-AAA CAA AAT AGC CTT CAA AA-3" 
5 '-TAG GCC CAA GGA ATT NAA AA-3' 
5*-AAA ATG ACT TCT TTG GGT GGG C-3 ' 
5'-TTC GCT GAG ATC ATG CCA C-3* 
S'-AGT GTT TTG AAG GTT GTA GGT TAA T-3* 
5'-ATC TTG GAT TTA GGG TTG GC-3* 
5'-TGT GTC ATT ACQ CTT TTC ATC-3' 
S'-TGC ATT GTT GTC ATG CCT-3' 
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FIG. 7E-3 



B Primer 



5*-GAA CCC TAG GAA GTG AAA TAC AAA A-3* 
S'-CAG GGC TAT GAT TGG ATG TC-3' 
S'-TTC CCA TCA GCG TCT TC-3* 
5'-CAA ACT TAG GGT TGT TCC TCA C-3' 
S'-TGA GAA GGT GTG TTA GGG TG-3 # 
5'-AGC TAT CAT CTA GAA AAG CAG CA-3' 
5'-AAA TTT GGT TAT TTT TAA GCA AAC T-3 # 
S'-TTG CTA AAC CTT GGG TGT GT-3* 

5'-GAC CTA TTT TGG TTA ACA ATT TAG A-3' 
S'-CTG ATG GAG GTT AAG GCA AG-3* 
5'.CCA ATT CAG TGG CAT CTA TG-3' 
S'-AGA AAT GAG ATA TTG TTTTCG C-3' 
r-CTC ATA ACT CAA AAC CTC TG-3 • 
5--GAT GTA ATC CTG TGC TAT GGC-3 f 
5'-TTG CCT GGA AAC CTG GTA-3' 
S'-TGT CAA AAT GGA CCA ATC AG-3' 



5'-GCC TGG TAA GTT GAT AGT GT-3* 
S'-TCA TCA TCA CCA CAA ATG CT-3' 
5'-GTG GGT AGC AAC ACT GTG GW 
5'-AGA CCT TTA GGT TGT TCA TGC TC-3 ' 
5-ATA TCT TTC AGG GGA GCA GG-3' 
S'-GGC TCT GCT CCA TCT TCA TA-3* 
r-TCA AAT GGT TCA GGA GAA AGA-3* 
_ S'-TAA AGT CTC CAT CTT CGA TTG T-3' 
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FIG. 7E-4 

Annealing Labeled 
Temp. Primer 
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FIG. 7F-I 
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FIG. 7F-2 

GROUP 6 

A Primer 



S'-TCA CCC CTA ATA CCC AAA AC-3' 
S'-AGT GGA CAG TTG GTA TCT CA-3* 
S'-ACT GGC CTG GCA GAG TCT-3' 
S'-CAC AAT CAT ATG TNC CAA TT-3' 
5'-CAG TAG GCA GGG GTG G-3* 
5 '.AAT TCA CAA GAC ACA ATC TCA G-3* 
5'-AGC TGA CTT TAT GCT GTT CCT-3 ' 
5'-CAA CAT ACT GCC TCA AAA-3' 

5 f .TTC GGC CAA AAA CAG ACT CC-3* 
5 f -AGT CAC CTT CTC TGT CTC CA-3' 
5*-AAC ATC TIA GGG CAT CCT G-3' 
5--ATC TTT TAT TGT GGG GTG CT^ 
5'-CTG GGC AAC AAG ACT GAA AT-3' 
5VIT3G AAA TAG AAT CCA GGC TT-3' 
5'-GCAACTTTT CTG TCA ATC CA-3* 
S'-GCT ATT CCC ACA AAG GCA-3* 

5*-ATC ATG GGA AGT GCG TGG-3* 
5MHTTCC TGC CAA CCT CTT TC-3' 
5*-GGG CAC AGG CAT GTG T-3' 
5'-CAG AAC TAA TCC CTT CTG GC.3* 
S'-TCC CTA CGT TGC ATT TTA-3' 
5'-AAA TCG CTA GAA AAT GTC CA-3' 
5*-TTTTGG AATTTC TAG CCT CC-3 f 
5-CAA CAG GTC CAG GCT ATG TC-3* 
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FIG. 7F-3 



B Primer 



S'-AAT ATG AAG GGA TGT TGA AT-3' 
S'-TGT GAT CAG CCC AGG AAG AG-3 f 
S'-CAG CCA TIC GAG AGG TCT-3' 
S'-ATT AAA TGT GCA TAC GCA AA-3* 
5'-GGG TGT GTC TGT GTC ACA AC-3' 
5 '-AG A ACT AAA GTT GCC TGT TCN TGT A-3' 
S'-TTT TCC ATG CCC TIC TAT CA-3 ' 
5*-TAC ACA AAA AGG AGG TCA TT-3* 

S'-TGA GAA CIT CCA CAT AGC AG-3' 

5'-AGG CCT CAT TCA AAA TCT GT-3' 

S'-AAT GAT TTA AAA TAG ATT AGG AGC A-3' . 

5'-TGC CCA GAC TTC TCA CCT-3* 

f-CAA ATT CCA CAA AGC CGT-3' 

5'-TCT ATC GTT AAC TTT ATT GAT TCA G-3' 

5*-ACC AAA CTT CAA ATT TTC GG-3' 

5 # -GGC GGA TCA TTG AGT GC-3* 

5--TAA TTA GTT GCT GGT TTG AA-3* 
5*-TTG GGT TCA AGC GAT TCT CC-3 * 
5 9 -GGC TGC ATT CTG AAA GGT TA-3* 
5 f -AGC TTC ATA AAG AGT CTG GAA AAT-3' 
5'-TAC CCA GCC AAA CTA TTA-3* 
S'-TCA CAC CTG GGA ATT AGA AG-3' 
5 f -TGA AAC CCA CAG ATA TTG GC-3* 
5 # .TAT CCA TAC ACA CCA TGC CA-3 ' 
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FIG. 7F-4 



Annealing Labeled 
Temo. Primer 
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FIG. 7G-I 
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FIG 7G-2 

GROUP 7 



S'-GCT AGO ATT ACA GGC ACA T-3 • 
5-TGC TCC ACA TCT TAG GGA GT-5' 
^■CTT TGC AGA ACC CAT GAT TAT GA-3 ' 
*'-AAT TCT GAA GAG CCA AAT CTA A-3' 
5'-AAC AAT TGG GAA ATG OCT TA-3 ' 
S'-TTA TGG CAG CCC AAA TGG ACT A-3' 
S' CAG CCA CAC GCA TAC AW 
5--AGG TTG ATA GAC CAT GGA CAC A-3' 

5--CTT ACT GTG TTG CCC AAG GT-3' 
5 '" TC ™ A <nTTTC TAC ATG GTG TAA T-3' 
5--TAO CCA TGA TAG GAA ATC AAC Or 

5--GTTTACGCCTCATGGArrTA.3- 
5'-CAG AGO TGG TTT TCA GTG GT-3" 
S'-GCG CAA CAC ACT CAG ACT CT-3' 
5'-TATTCC ATA CTT GAA TCT OCT C-3' 
5 '* CTO ATA ATA ^ CCA GGA AGA CAC-3 ' 

5 '* TTCTO0 ^ TGG ATA CTG GT-3' 
5--ATC GGA GAC GTA ATA CAC CC-3' 
5 '" AGA CCA ACT CCC TGC C-3' 

^-TCA AAG ACC CAT ATC AAC CA.3' 
5--ATTTCC TCA GGT CTA AAC CAC CC-3* 
5--AGC TTC TAT CCA ACA GGG 003' 
•S'-AAC AGG CTT GAA AGT CTC TGT C-3' 
5 '- TOA ° C Jfl ^TTGAAGAGG.3. 
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FIG. 7G-3 



B Primer 



5'-AGG CTC CTA CTA CCO TCA C-3* 
5'-ACA CCG CTC AGA AAT CAT ATA A-3 ' 
S'-ATT GCC TTG GAG GGC G-3* 
S'-AGG AAA ATA TAC ACA ACC CAA G-3' 
5'-TAG GTT GTG GTG GGT GTT AC-3* 
J'-GCA OAA TGT TGC CCA AAA CTC A-3 ' 
5 '•ACT TCA GGA ATA GCC TTT ACC3' 
5'-TTT TAT TGT TAT GTG GCT TTC A-3' 

5 '-AGC TCT ATG ATT CAT TTC AAG TTT G-3 ' 
5'-TCC TAA CAT TCT GCT ACC CA-3* 
5'-GAG ATC GTG CAG CAC TTG T-3* 
y-GGG CAC ACA GTC CCA A-3' 
5'-TCA GGG ATA GTT GGT GGG TA-3* 
i'-TGG GAT AGA AGC AAC ACA GA-3" 
5'-TGC ATC ACC TCA CAT AGG TTA-3 ' 
5-TATTGG CCT GAA GTG GTG-3* 

S'-TTT GGA TGC ACA GGA ACT TG-3' 
5 -ATG CTG CTG GTC TGA GG-3' 
5'-CAG CCT CGG AGA AAC G-3* 
S'-GTG CTG AAA AGC GAC ACT TA-3' 
J'-TTA GGC CCA GTC CAC ACT CAA G-3* 
S'-ACC AGA ATG TGA ACG ACC CT-3* 
5'-GCC TAT TTG ATA ATG CTG TAC G-3* 
S'-AOA AGO CAT TAA ATT TTG CA-3' 
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FIG. 7G-4 

Annealing Labeled 
Temp. Primer 
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FIG. 7H-I 
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FIG. 7H-2 

GROUP 8 

A Primer ' 



5'-ATC TCA TGG GAG TAC CGTTG-3' 
5'-AGC TIT GCA GAC AAG GCA AGO • 
S'-GAG TCT CCT AAA TGC TGG GG-3 ' 
5'-GTC CAG GAG GTT GAT GC-3' 
S'.TCT TGT CAC TCT AAC TCC GC-3' 
5'-AGA ATG TGG TCT CAC AAG CC-3' 
S'-GTT CAT AGA GGG ACA AGA CAC AGT-3* 

5'-A1T TGA GAG CAG CGT GTT TT-3' 
5'-GGC ACT TGT AAT CCC CG-3* 
S'-CAA AAA AAT GTT TTA CTA AGC AGG-3' 
S'-TTC ACA ACA GCC AAT CGT AG-3' 
S'-CCC GGC TGT GAA TAT ACT TAA TCC-3' 
S'-AAC TGG TTT TGG TAG TGA GA-3* 
S'-AAC ACT TCG ATG TTC CTT CC-3* 

S'-CCT CAA ACC GGA CAA CTA TTT-3' 
5'-CAA AAA GGC AGA ATG CAG TA-3* 
5'-ATT GGG TTT ACT TGT GCC TT-3' 
5'^CT CCA ATC TGC ACC TGA CT.3* 
S'^SCT CCC GGC TGG TTT T-3* 
S'-TTN CAA CAT AGG TTA TAC GCG-3' 
S'-TGT TGG AGT TAA TGT GCC AT-3" 
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FIG. 7H-3 

Annealing Labeled 

B Primer Temp. Primer 

5'-GAC CCA CAT CAC CAT TAC TG-3* 
5'.TCC CTG CTC ATA ACT CAG CC-3* 
5'-AGC TCC TGC ACA GTT CTT AAA TA-3' 
5'-AGT GCC CAT TTC TCA AAA TA-3* 
f-GGC CCA TOT CrTTIT TAG GT-3' 
5'-AGG GAA TGT CAA TGA AAA CC-3* 
5'-CCA TGA TGTTTG GTT AAT CAC A-3' 



5'-CCA TTA TGG GGA GTA GCG GT-3* 
f-TGA GCC ACT GCA CCT G-3' 
S'-AGG CAT GAC TCA CCG C-3* 
S'-TTC TCA AGG TTC GTC CAT GT-3 ' 
S'-CCC AAC AGC AAT GGG AAG TT-3* 
5*-GAG GTG CCC GCT ACT A-3' 
5'-AGC TGA GAG CGC ATG TAT AA-3' 



f-CAG AGA GCA AGA TCC TAC CTCT 
S'-TCC AGA GTC AAA AAC ACA GG-3* 
5'-CGT GAT TTC ATT TCT TGC TG-3 ' 
5'-TAG GCT TTG TTC TGG GGT TC-3* 
S'-GCA GGA AAT CGC AGG AAC TT-3' 
5'-GGC CCA GTT CAT TTT CTA GC-3* 
5'-TCT TTG ACC CAG ACC TCT AA-3 ' 
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FIG. 71- 1 
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F /& 71-2 

GROUP 9 

A Primer 



S'-AAG TGA TCC ACC TGC CTT G-3 ' 
5 '-CAA TIC TGT TCT AAG ATT ATT TTG G-3* 
5'-GGG GTT GAT TGA ACT TGG TT-3' 
5'-TAC TAA CCA AAA GAG TTG GGG-3* 
5'-AGC AGC AOC AGC CAT ATT GT-3' 
5 , -nT ACC TAA GGC TGG ATC TG-3' 
5'-TCG TGA GAN TAC TGC TIT GG-3' 



5'-AAAACA CCT TAC CIA AAA CAG CA-3' 
5'-ATG TIC AGA AAG GCC ATG TCA TIT G-3* . 
5'-TGC ACC ACA GCA TAC CAG TA-3 • 
i'-CTT GGG GAC TGA ACC ATC TT-3' 
5'-TTT GTG i TO GTC TTT TAT AGG CAT A- 3' 
5*-CCT CAA TGC ACA ACT CCT-3 ' 
5'-CTG ACGA CAG TTT CAG TAT CTC TAT C3' 

S'-GCC TTC ACT AAG CAA TCT CTA AA-3' 

5'-ACT ACC GCC AGG CAC T-3' 

S'-CTG TCG GAT TCC TTA GTG ATA C-3* 

5'-GAA GTA AAG CAA GTT CTA TCC ACG-3' 

5'-CTC GCG CTG GGT ACA GTT AT-3 ' 

S'-TIC ATT TGG AAG ATT TTC AC-3* 

5--AAC ACA CAT ACA AAC ACA CGC AGA T-3* 
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FIG. 71-3 

Annealing Labeled 
B Primer Temp. Primer 

S'-GCC TCT GAG AAT TAG TCT CTG TC-3 * 
S'-CTC TGG CTO AGO AGG C-3* 
5'-CAA GAC CCA TAC CCA TGA-3' 
S'-CTA TCA TTC AGA AAA TGT TGG C-3' 
5'-AGT CAO GCC CAC CCA ATT TA-3* 
5'-CAA AGT TGA CAC TGA TEA TAG CA-3 ' 
S'-TTT TGT CTA GCC ATG ATT GC-3* 



5'-AGATGA TGG TGA GTC CTG AG-3* 
S'-TCC CTA ACG GAT ACA CAG CAA CAC-3" 
5'-AAT OAA CAG CAA AAA CTA AGG GA-3* 
5'-AGC TAC CAT AGG OCT GGA GG-3' 
S'-OGC TCA AAG TGT TTG CAC TG-3" 
5'-CTC AGA CCT GOG TCA AGA TA-3 • 
S'-TTT CCA GAT TTA GGG GTG TAT G-3' 



S'-ACA TGC TCT GAA TCA CCT GA-3* 
S'-CTA AGA TAT GAA AAC CTA AGG GA-3* 
5'-ATA TTC AGA CAA AAG CCA AGT TA.3' 
5'-TCT GTG TAC GTT OAA AAT CCC-3* 
S'-AGA TCA GAG GAG TGG GTT CC-3* 
S'-GGG CCA GAA TGG GTA T-3 ' 
S'-TTC CAG ACA GGA CAG CCT GC-3* 
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FIG. 7J-I 
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FIG. 7J-3 

Annealing Labeled 

B Primer Temp- Primer 

S'-AGT GAA ACT CCG NCC CTA-3* 
5 f -AAC AAA CTT OCT TAT GAG TGT TAC T-3' 
5*-AAA ACA TIT CCA TTA CCA CTG-3' 
S'-GCT GAA GGC TGT TCT ATG GA-3' 
5'-GCA GTT GGG TTA TTT CAA GTC-3' 
5*-CTA CGT ACA TGG CTG CAA*3 f 
S'-CCA TCT TGG TGT GAG GGC-3' 
S'-GCT GAG CAA GGC ATT CTT T-3 f 



S'-ACT GAG GTC ATG CAA GAG GC-3' 

5'-GAG CAA GAC TGC ATC TCA AA-3' 

S'-GTG TCA GGT CGG GGT G-3' 

S'-ACG ATT TCT GGG AGA CTA TAT TGC-3* 

S'-TTC TCA CTG CTT TTC TCT GC-3' 

S'^CCC CTG AAC ACC CTG A-3' 

S'-CCA ACA CCT GAG TCA GCA TA-3' 



S'-ATG TAA CAA AAT GGA GTC GG-3' 
5*-TCC TAA TTC ACT GGG AAA AC-3 # 
5--ATT ACA GGC GTG ACA CAC C-3 # 
S'-GTT TGC CTG GGG ATT GAT TT-3' 
S'-ATA GAC TGT GTA CTG GGC ATT GA-3' 
5'-ATG AAG AAA TAT ATA CAG TGC CG-3' 
5'-CAT GCC TAG ACT CCT GAT CC-3' 
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FIG. 7K-I 



Maricer Alleles (bo) Heterozygosity Chromosome 
SET A 

D5S408 (247-299) 73 % 5 

D9S180 (220-265) 63% 9 

D5S414 (186-206) 82% 5 

DIS304 (168-206) 60% \ 

D6S344 (139-159) 72% 6 

D12S76 (112-124) 71% 12 

D10S219 (89-103) 76% ] 0 

SET B 

D11S906 (291-303) 73% \\ 

D15S121 (258-264) 66% 15 

D5S425 (224-248) 77% 5 

D5S395 (189-213) 81% 5 

D13S217 (160-174) 67% 13 

D2S206 (123-151) 79% 2 

D6S263 (90-114) 81% 6 

SET C 

D14S74 (291-313) 79% j 4 

D20S98 (259-275) 79% 20 

D9S168 (227-247) 75% 9 

D16S421 (206-212) 56% j 6 

D13S173 (166-178) 82% 13 

D8S261 (128-148) 77% 8 

D9S17B ( 9 3 . 99) 66% 9 
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FIG. 7K-2 

GROUP 11 

A Primer 



5'-ACA ACT TCC AAC CCT GAG AT-3' 
9'-CAG TGG TTT GGA ATC GAA CC-3 
S*-GGC CAG TTC AGTCAA GTG-3' 
S'-ACC CTT TTT CCT CCA ATC AT-3 ' 
S'-CTC CAG CCT GGG TCA CTA-3 * 
5*-GGG CTA CAT GAT GAG ACC CT-3' 
5*-TCT TTC TAC CAC CCC CC-3* 

5*-AGCTGG GCA CCG ATA GTA GT-3* 

5'-TTG TAT CAG GGA TTT GGT TA-3' 

5'-CTC CAG CCT GCT GAC C-3* 

5'-GCA GAT GGA AAA CAC CAC TT-3 ' 

5'-ATG CTG UGA TCA CAG GC-3* 

S'-TTA AAA ATT AAG TAG GCT TTT GGT f-3 ' 

S'-CTT AAG GCA AAA TTC TTT TCA ACA C-3* 

S'-CCT GTA CCA CTA CCT GAG TTG AGT-3' 
5*-GAA CTT GCA TAA CCC GAA T-3* 
5'-GGT TTG TOO TCT TTG TAA GO-3* 
5'-ACA TGA ACC GAT TGG ACT GA-3' 
5'.CCC TGT TCC ACT AAT GAT GAC C-3* 
5'-TGC CAC TGT CTT GAA AAT CC-3* 

S'-GAA TAA AAC AGO GTT TGG G-3 ' 
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FIG. 7K-3 



Annealing Labeled 
B Primer Temp. Primer 



5'-ACT GTG CCT AGC CTT CAT TT-3" 
5'-AGC TAT TTT TGG GGG CTG AG-3' 
5'-TGG TTC CAG CAT ATA GCG-3 • 
S'-AGA AGC TGA AAG CTG AGT GG-3' 
S'-CTA ATG CAT GAC AAT AAT ATT TCC A-3 • 
5'-GCG GAG CTT CTT TTC TGT TG-3' 
5'-GCA GAG AAC CTA AAG CAT CC-3* 



5'-GCA CAG GCA AAG ANG AGG TA.3* 

5*-TGT TGT CGC TTC AGT ACA TA-3* 
S'-TCT TGG GCA AGC CAT C-3* 
5'-ACC TGC TGC TGG AAG ATT AC3' 
5'-AAC CTG GTG GAC TTT TGC T-3 • 
5'-GTC CTC ATG TGT TTA TGC TGT-3' 
5'-CTC AAA GTA AGA CCA TAA AAT ACC A-3' 

S'-CTT TGG CTG CCC GAA A-3* 
5'-CAA GGG TAT GTT CCC CAA AA-3 ' 
5'iTGG TTT GTT TGT ATA ACT ATC AT TG-3' 
5'-CCG TTCC CTA TAT TTC CTG G-3 1 
i'-GTC TCT GGC TGC TCT CAA GAC TAT-3' 
5'-TAT GGC CCA GCA ATG TGT AT-3* 

5'-TTT CTC TAA GAA CTT TGG GG-3 ' 
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FIG. 7L-I 

Marker Alleles (bp) H eterozygosity Chromosome 
SETA 



D19S209 


(206-272) 


77% 


19 






92% 


14 






72% 


10 


D12S87 


(142-168) 


79% 


12 


D13S158 


(99-113) 


81% 


13 


SET B 




• 




D11S931 


(251-267) 


73% 


11 


D16S415 


(208-234) 


72% 


16 


D11S925 


(173-199) 


84% 


11 


D16S409 


(135-147) 


70% 


16 


D13S219 


(117-127) 


64% 


13 


D22S284 


(86-102) 


76% 


22 


SETC 








D13S157 


(250-264) 


72% 


13 


D14S78 


(211-233) 


66% 


14 


D13S168 


(173-197) 


76% 


13 


D15S122 


(143-159) 


77% 


15 


D18S70 


(111-126) 


83% 


18 
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FIG. 7L-2 

GROUP 12 

A Primer 



J'-TTC ATT CAC AAA TCN ATG GC-3' 
5'-GCG TGA GTC ACT GTG CC-3* 
5*-CAAAAG TAA CCATTO AGC CC-3' 
S'-CACTAG GTG ATG CTG GAC AT-3* 

S'-GTA CCC ACG GAG TGA AAG AA-3' 



5*-GATTGC TTG AGC CCA G-3 • 
5--CCAGTA ATG TTA TGT AAG TCA ATG C-3* 
5'-AGA ACC AAG GTC GTA AGT CCT G-3' 
5'-TGA ATC TTA CAT CCC ATC CC-3* 
S'-AAG CAA ATA TGC AAA ATT GC-3 ' 
5'-ATG GGT ATT TAA CTT CTC TAC ACA G-3* 



S'-AGCTGA GAA ATC ACA ACA GAG A-3' 
5'-GGC AGG GAT AAG TAT GTC CT-3* 
S'-GCCTAG CCC AGT GGT G-3* 
S'-GAT AAT CAT GCC CCC CA-V 
5'-AAG GCT GAN CTC TAC CG-3* 
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FIG. 7L-3 



Annealing Labeled 
B Primer Temp. Primer 



5'-CTG GAG AGC ATA GAC GNA GA-3 ' 
5'-CAG ACA GAA ATT AAC CAG AGT TGA A3' 
5*-TTG ATA GAA GAA GCG ATA GAT CG-3' 
5*.CTC CAC AAA CAC TTG AAA CA-3' 

S'-GCT TTG ACA ATT TAG CAG CA-3 ' 



5'-GAG AAA TAG TAT GTG TTT GCC-3' 
S'-TAG CCA CTC TAC CCC AGC3* 
5*- iTA GAC CAT TAT GGG GGC AA-3* 
5'-AGT CAG TCT GTC CAG AGG TG-3* 
5'-TCC TTC TGT TTC TTG ACT TAA CA-3 ' 
S'-GCT CTC TTG AGG TCG TTA CA-3* 



S'-TGG AAA TTT GCT GAC AGT AGA T-3" 
5 '-AAA GGT AAC ATC CAA GGG GT-3* 
5'-TGC TTG TGC CTA TGT TCT TG-3* 
5'^CC AGT ATC TGG CAC GTA G-3' 
5'^3GA ATG TCA AGA AGT ACC TAC CAT A-3' 
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F/a 7M-I 



Maricer Alleles (bp) Heterozygosity Chromosome 
SET A 

D13S156 (272-286) 80% 13 

D19S226 (235-263) 84% 19 

D16S422 (188-212) 78% 16 

D18S65 (168-178) 71% is 

D16S413 (131-149) 83% 16 

D20S95 (82-100) . 83 % 2 0 
SET B 

D22S279 (249-258) 73 % 22 

D19S222 (233-241) 65% 19 

D6S281 (203-219) 67% 6 

DI7S808 (147-16-,; 67% J7 
SET C 

P21S260 (267-277) 51% 2 1 

DI9S218 (240-256) 60% 19 

D22S280 (208-220) %\% ^ 

D17S799 (186-200) 68% 17 

DI9S210 (165-177) 73% 19 

D11S922 (88-138) 92% n 
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FIG. 7M-2 

GROUP 13 

A Primer 



i'-ATT AGC CCA GGT ATG GTG AC-3' 
5'-CCA OCA GAT TTT GGT GTT GTC TA-3' 
5'-CAG TGT AAC CTC GGG GCV 
5'-GAG GCA GGA AAT TGC ACT GT-3 1 
S'-ACT CCA GCC CGA GTA A-3' 

5--AAA GCA AGG CTT CGT CTT AA-3 • 



5*-GCG ATC CAG CCT GTG T-3' 

5'-GAAATGTCCTATTTGAAACTGTGC-3' ' 
5--CTG GTA GTG TCA GGC ATG GC3' 

5'-ACC CTA GAC AGO ATG CCA-3' 



f-AGC TGT TCA TGC TTC CAT CT-3' 
•-TTT GCA TTT TCT GGA GTT TT-3' 
'-OCT CCA GCC TAT CAG GAT G-3' 
-ATT GCC AGC CGT CAG TT-3 • 
-TCA CAC TCA CTC GTC TCT CA-3' 
■GQG GCA TCT TTG GCT A-3* 
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FIG. 7M-3 



B Primer 



Annealing 
Temp. 



Labeled 
Primer 



S'-GCT GTG GTA TGA GTT ACT TAA ACA C-3' 
5'-GGT CCAGGA TTT OAA CTA AAG CA-3* 
5'-CTT TCG ATT AGT TTA GCA GAA TGA G-3' 
S'-GCT GGT CTT ACT ATC TCA GGG G-3* 



5'-TTC NTC ATT TTA TTG TGTGCG-3' 



5'-TGT AAA TCG GGT AAG TGA TGC-3' 
5'-CTG TIG AAA TOT ATC GAG TAA ATC G-3' 
5'-CCTATG TTT CAG GCA AAG GC3* 

S'-TGT GGG TTT TCT CAG GTT AT-3" 



5'-AGA GCC CAG AAT ATT GAC CC-3' 
5'-AAT GTC CCT AAA CAC ATG GA-3* 
5'-GAT TCC AGA TCA CAA AAC TGG T-3* 
5*^3AC CAG CAT ATC ATT ATA GAC AAG C3* 
S'-GGT GTG CCT GTG TOT AAA AG-3' 
5'-TCC GGT TTG GTT CAG G-3' 



S'-GGT CAC AGG TGG GTT C-3' 
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FIG. 8 





primer.B' piimerA 
1 2 3 4 S lane 1 2 3 4 S 
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FIG. 9 
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